GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: April 19, 2005, 23:16:46 



Search time 20871 Seconds 

(without alignments) 

11663.984 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



US-10-032-256A-1 
5024 

1 gcaggaggagccagggcagc . 



, aaaaaaaaaaaaaaaaaaaa 5024 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 4708233 seqs, 24227607955 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



9416466 



Database 



GenEmbl 



1 




gb ba:* 


2 




gb htg:* 


3 




gb in:* 


4 




gb om : * 


5 




gb ov:* 


6 




gb pat : * 


7 




gb_ph : * 


8 




gb_pl : * 


9 




gb pr:* 


10: 


gb ro : * 


11: 


gb sts:* 


12: 


gb sy:* 


13: 


gb__un : * 


14: 


gb vi : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 

Result Query 

No. Score Match Length DB 



ID 



Description 



1 5024 100.0 

2 4821.6 96.0 



5024 10 AF167987 
5026 10 AF055919 



AF167987 Mus muscu 
AF055919 Mus muscu 



3 


3299. 8 


65. 


7 


149947 


10 


AC133505 


AC133505 Mus muscu 


4 


2142 


42. 


6 


2142 


6 


AX754694 


AX754694 


Sequence 


5 


2138 . 8 


42. 


6 


2142 


6 


AX754695 


AX754695 


Sequence 


6 


1861. 8 


37. 


. 1 


247180 


2 


AC108663 


AC108663 


Rattus no 


7 


1750 


34 . 


8 


7385 


9 


HSA271722 


AJ271722 


Homo sapi 


8 


1714 . 6 


34 . 


1 


2145 


6 


AX056395 


AX056395 


Sequence 


9 


1394 . 8 


27 , 


, 8 


1923 


6 


CO731710 


CQ731710 


Sequence 


10 


869 


17 . 


,3 


193142 


2 


AC113861 


AC113861 


Rattus no 


11 


847 . 6 


16. 


, 9 


274928 


2 


AC114717 


AC114717 


Rattus no 


12 


769 


15. 


, 3 


2931 


5 


BC084068 


BC084068 


Xenopus 1 


13 


765 . 8 


15, 


,2 


2931 


5 


AY318877 


AY318877 


Xenopus 1 


14 


704.4 


14. 


, 0 


2499 


5 


AY318878 


AY318878 


Xenopus 1 


15 


522 .2 


10, 


, 4 


181254 


9 


AL954201 


AL954201 


Pan trogl 


16 


522 .2 


10. 


, 4 


197872 


9 


AL954202 


AL954202 


Pan trogl 


17 


510. 6 


10. 


, 2 


63402 


9 


AP000260 


AP000260 


Homo sapi 


18 


510. 6 


10. 


, 2 


100000 


9 


AP000099 


AP000099 


Homo sapi 


19 


510 . 6 


10, 


, 2 


100000 


9 


AP000175 


AP000175 


Homo sapi 


20 


510. 6 


10, 


, 2 


144857 


9 


AP000036 


AP000036 


Homo sapi 


21 


510. 6 


10. 


,2 


340000 


9 


AP001712 


AP001712 


Homo sapi 


22 


482 . 2 


9, 


, 6 


656 


6 


CQ054560 


CQ054560 


Sequence 


23 


482 . 2 


9 , 


, 6 


656 


6 


CQ055015 


CQ055015 


Sequence 


24 


482.2 


9 , 


,6 


656 


6 


CQ055055 


CQ055055 


Sequence 


25 


482.2 


9 , 


. 6 


656 


6 


CQ073803 


CQ073803 


Sequence 


26 


482 . 2 


9 , 


, 6 


656 


6 


CQ074272 


CQ074272 


Sequence • 


27 


482 . 2 


9 . 


. 6 


656 


6 


CQ074313 


CQ074313 


Sequence 


28 


482.2 


9 . 


. 6 


656 


6 


CQ104692 


CQ104692 


Sequence 


29 


482.2 


9 , 


, 6 


656 


6 


CQ105166 


CQ105166 


Sequence 


30 


482.2 


9 , 


. 6 


656 


6 


CQ105207 


CQ105207 


Sequence 


31 


482.2 


9 . 


, 6 


656 


6 


CQ143397 


CQ143397 


Sequence 


32 


482 .2 


9 . 


, 6 


656 


6 


C0143861 


CQ143861 


Sequence 


33 


482 . 2 


9 , 


, 6 


656 


6 


CQ143902 


CQ143902 


Sequence 


34 


482 . 2 


9 , 


, 6 


656 


6 


CQ178890 


CQ178890 


Sequence 


35 


482 . 2 


9 , 


. 6 


656 


6 


CQ179340 


CQ179340 


Sequence 


36 


482 . 2 


9 , 


. 6 


656 


6 


CQ179380 


CQ179380 


Sequence 


37 


482.2 


9, 


. 6 


656 


6 


CQ203234 


CQ203234 


Sequence 


38 


482.2 


9, 


.6 


656 


6 


CQ203701 


CQ203701 


Sequence 


39 


482.2 


9, 


.6 


656 


6 


CQ203742 


CQ203742 


Sequence 


40 


482.2 


9, 


. 6 


656 


6. 


CQ226598 


CQ226598 


Sequence 


41 


482.2 


9, 


. 6 


656 


6 


CQ227065 


CQ227065 


Sequence 


42 


482.2 


9, 


.6 


656 


6 


CQ227106 


CQ227106 


Sequence 


43 


482.2 


9, 


.6 


656 


6 


CQ264740 


CQ264740 


Sequence 


44 


482.2 


9, 


.6 


656 


6 


CQ265212 


CQ265212 


Sequence 


45 


482.2 


9. 


.6 


656 


6 


CQ265253 


CQ265253 


Sequence 



ALIGNMENTS 



RESULT 1 
AF167987 
LOCUS 
2000 

DEFINITION 

ACCESSION 
VERSION 



AF167987 

Mus musculus 
(Hunk) mRNA, 
AF167987 

AF167987.1 GI:7021318 



5024 bp 



mRNA 



linear 



ROD 23-FEB- 



hormonally upregulated neu tumor-associated kinase 
complete cds . 



KEYWORDS 

SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 5024) 

AUTHORS Gardner, H. P . , Wertheim, G . B . , Ha,S.I., Copeland, N . G. , Gilbert, D . J. , 

Jenkins, N. A. , Marquis, S.T. and Chodosh, L. A. 
TITLE Cloning and characterization of Hunk, a novel mammalian 

SNFl-related protein kinase 
JOURNAL Genomics 63 (1), 46-59 (2000) 
MEDLINE 20130113 
PUBMED 10662544 
REFERENCE 2 (bases 1 to 5024) 

AUTHORS Gardner, H. P. , Wertheim, G. B.W. , Ha,S.I., Copeland, N . G. , 

Gilbert, D. J. , Jenkins, N .A. , Marquis, S.T. and Chodosh, L. A. 
TITLE Direct Submission 

JOURNAL Submitted ( 11- JUL-1999 ) Molecular and Cellular Engineering, 
University of Pennsylvania, 612 BRBII, 421 Curie Blvd., 
Philadelphia, PA 19104-6160, USA 
FEATURES Location/Qualifiers 
source 1. .5024 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="FVB" 
/db_xref="taxon: 10090" 
/ chromosome="16" 

/map="linked to App, Tiaml and Erg" 

/dev_stage="day 14 embryo" 
gene 1. .5024 

/gene="Hunk" 
CDS 72. .2216 

/gene="Hunk" 

/note="serine/threonine protein kinase" 
/codon_start=l 

/product="hormonally upregulated neu tumor-associated 
kinase" 

/protein_id="AAF35282 . 1" 
/db_xref="GI : 7021319" 

/translation="MPAAAGDGLLGEPAAPGGDGGAEDTTRPAAACEGSFLPAWVSGV . 

SRERLRDFQHHKRVGNYLIGSRKLGEGSFAKVREGLHVLTGEKVAIKVIDKKRAKKDT 

YVTKNLRREGQIQQMIRHPNITQLLDILETENSYYLVMELCPGGNLMHKIYEKKRLDE 

AEARRYI RQLI SAVEHLHRAGWHRDLKI ENLLLDEDNNI KLI DFGLSNCAGI LGYSD 

PFSTQCGSPAYAAPELLARKKYGPKIDWSIGVl^AMLTGTLPFTVEPFSLRALYQK 

MVT)KAMNPLPTQLSTGAWFLRSLLEPDPVTCRPNIQQALANRWLNE 

PNRISLEDLSPSWLHMTEKLGYKNSDVINTVLSNRACHILAIYFLLNKKLERYLSGK 



SDIQDSICYKTQLYQIEKCRATKEPYEASLDTWTRDFEFHAVQDKKPKEQEKRGDFLH 



RPFSKKLDKNLPSHKQPSPSLITQLQSTKALLKDRKASKSGFPDKDSBVCRNLFRKTS 



DSNCVASSSMEFI PVPPPRTPRIVKKLEPHQPGPGSASILPKEEPLLLDMVRSFESVD 

REDHIELLSPSHHYRILSSPVSLARRNSSERTLSQGLLSGSTSPLQTPLHSTLVSFAH 

EEKNSPPKEEGVCSPPPVPSNGLLQPLGSPNCVKSRGRFPMMGIGQMLRKRHQSLQPS 

SERSLDASMSPLQPIAPSSLSFDMADGVKGQC" 
polyA_signal 4980. .4985 
/gene="Hunk" 

ORIGIN 

Query Match 100.0%; Score 5024; DB 10; Length 5024; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 5024; Conservative 0; Mismatches 0; Indels 0; Gaps 

0; 

Qy 1 GCAGGAGGAGCCAGGGCAGCCCCGGGAGCCGGAGGAGGAGCGGCTGCGAGCGCGGGAGCC 60 

I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 1 GCAGGAGGAGCCAGGGCAGCCCCGGGAGCCGGAGGAGGAGCGGCTGCGAGCGCGGGAGCC 60 

Qy 61 GAGCGAGCGCGATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGG 120 

I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I II I I I I I I I I I I I I I II I I I I II 
Db 61 GAGCGAGCGCGATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGG 120 

Qy 121 GGGGCGATGGAGGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCC 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GGGGCGATGGAGGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCC 180 

Qy 181 TGCCCGCCTGGGTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGC 240 

I I I I I I I I I II I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 TGCCCGCCTGGGTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGC 24 0 

Qy 241 GCGTGGGCAACTACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGC 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 241 GCGTGGGCAACTACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGC 300 

Qy 301 GCGAGGGGCTGCACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 G CGAGGGG CT GCACGT GCT GAC GGGAGAAAAGGTAGCTATCAAGGT CAT C GATAAGAAAA 360 

Qy 361 GAGC CAAGAAAGACAC CT ACGT CAC CAAAAAC CT GC GT C GAGAG GGGCAGAT C CAGCAGA 420 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 GAG C CAAGAAAGACAC CT AC GT CAC CAAAAAC CT GC GT C GAGAGGGGCAGATC CAGCAGA 420 

Qy 421 TGAT CCGACACCCCAACATCACACAGCT CCT GGACAT CTTGGAGACAGAGAACAGCTACT 480 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 421 T GAT C C GACAC C C CAAC AT CACACAGCT C CT GGACAT CTT GGAGACAGAGAACAGCT ACT 4 80 

Qy 481 ACCTGGTCATGGAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAAC 540 

II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 ACCTGGTCATGGAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAAC 54 0 

Qy 541 GGTTGGATGAAGCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACC 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I M I I I I 

Db 541 GGTTGGATGAAGCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACC 600 



Qy 

Db 



601 
601 



TGCACCGTGCGGGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGCACCGTGCGGGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAG 



660 
660 



Qy 661 ACAATAATAT CAAG CT GATT GACT T T GGCTT GAGCAACT GT GCAGGGAT C CTAGGTT ACT 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 ACAATAATAT CAAGCT GAT T GACT T T GGCT T GAG CAACT GT GCAG GGAT C CTAGGTT ACT 720 

Qy 721 CGGATCCATTCAGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCA 780 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 721 CGGATCCATTCAGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCA 780 



Qy 781 GGAAGAAAT AT G GC C C CAAAAT T GAT GT CT GGT CAAT AGGC GT GAACAT GT ATGC CATG C 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I I 
Db 781 GGAAGAAAT AT GGC C C CAAAATTGAT GTCT G GT CAAT AGG C GT GAACAT GT AT GC C AT G C 840 

Qy 841 TGACGGGGACCCTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGA 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 TGACGGGGACCCTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGA 900 



Qy 901 TGGTGGACAAAGCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTC 960 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 TGGTGGACAAAGCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTC 960 

Qy 961 TGCGCTCCCTCCTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGCTGGCGA 

1020 

I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 961 TGCGCTCCCTCCT GGAAC CAGAC C CT GT GAAGAGG C C GAATAT C C AG CAAG C GCT G G CGA 

1020 



Qy 1021 AT CGCT GGTT GAAT GAGAATTAC ACT G GAAAGGT GC C CT G CAAT GT C AC CT AT C C CAACA 

1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1021 AT CG CT GGT T GAAT GAGAAT TACACTGGAAAG GT GC C CT GCAAT GT CAC CT ATC CCAAC A 

1080 

Qy 1081 GGAT TT CT T T GGAAGAC CT GAGT C C CAGC GT GGT GCT GCACAT GACT GAAAAGC T G G GCT 

1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1081 GGATTTCTTTGGAAGACCTGAGTCCCAGCGTGGTGCTGCACATGACTGAAAAGCTGGGCT 

1140 



Qy 1141 ATAAGAACAGTGACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCA 

1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 ATAAGAACAGTGACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCA 

1200 

Qy 1201 T CTACTT C CT GTT GAACAAGAAACT T GAGCGCTATTT GT CAGGGAAATC AGATAT C CAAG 

1260 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1201 TCT ACTT C CT GT T GAACAAGAAACTT GAGC GCTAT T T GT CAG GGAAAT CAGATAT C CAAG 

1260 



Qy 



1261 AT AGCAT CT GCT ACAAGAC C CAGCT CTAC CAGAT AGAGAAGT GCAGAGC CACCAAGGAGC 



1320 

1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1261 AT AGCAT CT GCT ACAAGAC CCAGCT CTAC CAGATAGAGAAGT GCAGAGCCAC CAAGGAGC 

1320 

Qy 1321 CCTATGAGGCCTCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATA 

1380 

1 II I I I I 1 I I I I I I II I I I I I I I I I I I I I I i I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 

Db 1321 CCTATGAGGCCTCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATA 

1380 

Qy 1381 AAAAGCCCAAAGAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGT 

1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I 
Db 1381 AAAAGCCCAAAGAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGT 

1440 

Qy 1441 T GGACAAGAAC CT G C CT T CT CACAAACAG C CAT C GC C CT CGCT GAT C ACACAGCT C CAGA 

1500 

I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1441 T GGACAAGAAC CT GC CTT CT CACAAACAGC CAT CG C C CT CGCT GAT CACACAGCT C CAGA 

1500 

Qy 1501 GT AC C AAAG C C C T G C T C AAAGAC AG GAAG G C C T C C AAGT C AG GCTTCCCC G AC AAAGAT T 

1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 GTACCAAAGCCCTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATT 

1560 

Qy 1561 CCTTCGTCTGCCGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTT 

1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 CCTTCGTCTGCCGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTT 

1620 

Qy 1621 CCATGGAATTCATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGC 

1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 C CAT GGAAT T C AT CCCT GT CC CACCT C CC AGGACAC CAAGGATT GTAAAGAAACT AGAG C 

1680 

Qy 1681 CACACCAACCAGGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGG 

1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 CACACCAACCAGGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGG 

1740 

Qy 1741 ATATGGTACGCTCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTT 

1800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1741 ATATGGTACGCTCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTT 

1800 



Qy 

1860 



1801 CT CAC CATT AT AGGATC CT GAGCTC GC CT GT GAGCCT GGCT CGTAGGAATT CTAGT GAGA 



I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 1801 CT CAC C ATTAT AGGAT C CT GAG CT C GCCT GT GAGCCT GG CT C GTAGGAATT CT AGT GAGA 

1860 

Qy 1861 GGACACTCTCCCAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATT 

1920 

I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 1861 GGACACTCTCCCAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATT 

1920 

Qy 1921 CCACGCTGGTCTCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGT 

1980 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1921 CCACGCTGGTCTCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGT 

1980 

Qy 1981 GTTCACCGCCTCCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTG 

2040 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1981 GTTCACCGCCTCCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTG 

2040 

Qy 2041 TGAAGAGCAGGGGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCACC 

2100 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2041 T GAAGAGC AG G GGAC GGT T C C C CAT GATGGG CAT C GGAC AGAT GCT GAGGAAGC G GCACC 

2100 

Qy 2101 AGAGCCTGCAGCCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCA 

2160 

I I I I II I II I I I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2101 AGAGCCTGCAGCCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCA 

2160 

Qy 2161 TAGCCCCCTCCAGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGTTAACCTG 

2220 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2161 TAGCCCCCTCCAGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGTTAACCTG 

2220 

Qy 2221 GGAT GGCAAGATTCT GGGTCTCT GTGAGGACAGCCACGGAACAGAGCT CCACACAGGCAG 

2280 

I I I I I I I I I I I I I I I I I I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2221 GGAT GGCAAGATT CT G G GT CT CT GT GAGGACAGC CAC GGAACAGAGCT C CACACAGGCAG 

2280 

Qy 2281 GCACCAGGGCAT GGGT GAACAAC CT CAC GGGAGCAT C CTTT AT T CT T TT ATAC CT GCCAC 

234 0 

I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2281 GCACCAGGGCATGGGTGAACAACCTCACGGGAGCATCCTTTATTCTTTTATACCTGCCAC 

2340 

Qy 2341 ACAAAGTCCCACGCTTGTATCAGCTGAAGTCCACACTCAAAGTCCACGCACTTACTTAGG 

2400 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 2341 ACAAAGT C C CACGCT T GT AT CAGCT GAAGT CCACACT CAAAGTCCAC G CACTT ACT T AGG 

2400 



Qy 2401 GACCCTCTGAGACGCTGCCACTAGGGGGAGGGGGAGGGGGCAGACTGTGGGAATCACACC 

2460 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2401 GACCCTCTGAGACGCTGCCACTAGGGGGAGGGGGAGGGGGCAGACTGTGGGAATCACACC 

2460 

Qy 2461 TTCCAGCCTGAGATTTTCTTTGCTATCACCAATCACTGAGCCCTCTCCAGGATCCCCTCA 

2520 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2461 TT CC AGCCT GAGATTTT CTT T G CT AT CACCAATCACT GAGC C CT CT CCAGGAT C C C CT CA 

2520 

Qy 2521 GTGGGCTCAGAGCTAAAAACCACACCTCCATCTGCTGGGCCAATCAGATTTCCAGACTGG 

2580 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2521 GTGGGCTCAGAGCTAAAAACCACACCTCCATCTGCTGGGCCAATCAGATTTCCAGACTGG 

2580 

Qy 2581 TACCAGGTTGTCCCTCCCCTCCTCTCTGTGTGTCTCTCACAGTTCTGTAACTGACCGTCA 

2640 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2581 TACCAGGTTGTCCCTCCCCTCCTCTCTGTGTGTCTCTCACAGTTCTGTAACTGACCGTCA 

2640 

Qy 2641 GTGGTCAGTTACAGTCTCACGCGGACGTGCCACTCGCTGGTAAGGACGTTCACCCAACCT 

2700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2641 GTGGTCAGTTACAGTCTCACGCGGACGTGCCACTCGCTGGTAAGGACGTTCACCCAACCT 

2700 

Qy 2701 AGGGAT C C CT CT ACAGAGGGAAGCAAC CCT C CTTT C C CTAAC AGT GAGT C CC CACAGAGT 

2760 

I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2701 AGG GAT C C CT CT ACAGAGG GAAG CAAC CCTCCTTTCC CTAACAGT GAGT C C C CACAGAGT 

2760 

Qy 27 61 GCT GAGT CACAGT GCTGGACCGGGAGGAAGATGGGATGGCGCCT CAGACAGAGAT GGAAC 

2820 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 27 61 GCT GAGT CACAGT GCT G GAC C GGGAGGAAGAT GGGAT GGC GC CT CAGACAGAGAT G GAAC 

2820 

Qy 2821 CCAG C AGC GAGAAC C CAGGAGGAAGAC GAAGACT CAAAC GCT CATT C CT GT GCAAC GTTT 

2880 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2821 CCAGCAGCGAGAACCCAGGAGGAAGACGAAGACTCAAACGCTCATTCCTGTGCAACGTTT 

2880 

Qy 2881 TGACAGATTTTTCTTTCCTCTCTTTCTTTTTCCCCTGACCTTTTCTTCTTTTTGGGTTGA 

2940 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 2881 TGACAGATTTTTCTTTCCTCTCTTTCTTTTTCCCCTGACCTTTTCTTCTTTTTGGGTTGA 

2940 



Qy 



2941 AACTT GCT GAGGATT GAAC GAACTT GT C CAAAGAGAT CT TTCT T TAT AT GAAGT CATTAA 



3000 

I I I I I I I I I I I I I I 1 1 I I I I I I 1 1 I I 1 1 1 1 1 1 I I I I 1 1 I I I I I I 1 1 I I I I i I 1 1 I I I I I I 

Db 2941 AAC T T G CT GAG GAT T GAAC GAAC T T GT C C AAAGAGAT CT T T CT T TAT AT GAAGT CAT T AA 

3000 

Qy 3001 TTAAATTTTTTTTTTAAAGACAGGGTCTCATTAAGTAGCCCAAGCTGGCTTCAAACTCAT 

3060 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3001 TTAAATTTTTTTTTTAAAGACAGGGTCTCATTAAGTAGCCCAAGCTGGCTTCAAACTCAT 

3060 

Qy 3061 GAT C CT C CT G C CT CAGC CT C CAAAGT G CT GAGAT T ACAAGT AT AT ACCC GT GT CT GGCTC 

3120 

I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3061 GAT C CT C CTG CCT CAGC CT C CAAAGTGCT GAGATTACAAGTAT AT ACCC GT GT CT G GCTC 

3120 

Qy 3121 AAAAT AGCAATT CAAAAACAAAAACTAGTT GGCCAGAT GAAAAGTAGT TTTAC CAAAT T C 

3180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3121 AAAAT AGCAATT CAAAAACAAAAACTAGTTGGCCAGAT GAAAAGTAGTTTTACCAAATTC 

3180 

Qy 3181 ACGTGTTTTTGTTTTTCTGAGAGGCTGCAGCTCAGATGGCCAAAAAGCTGGCAACAGGAG 

3240 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3181 ACGTGTTTTTGTTTTTCTGAGAGGCTGCAGCTCAGATGGCCAAAAAGCTGGCAACAGGAG 

3240 

Qy 3241 GACCACAGTGGCCTGCCTGCCTAAGGGATAGTAGCCTAGCCATCCTGTGTTTATACCGTG 

3300 

I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3241 GACCACAGTGGCCTGCCTGCCTAAGGGATAGTAGCCTAGCCATCCTGTGTTTATACCGTG 

3300 

Qy 3301 GCAG CAGCAGAAG GC AT AGAACTTAGCT C CAGAT GGCT CT GGAGAGAGAGAAAGGAT T CT 

3360 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3301 GCAGCAGC AGAAGGCATAGAACT T AGCT C CAGAT GGCT CT GGAGAGAGAGAAAGGAT T CT 

3360 

Qy 3361 TAAAGCAGAGTTGAGACAGCAAGAAGCAGGGAATTCGCTGTGTCATGCTGTTCTGCCGTG 

3420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3361 TAAAGCAGAGTTGAGACAGCAAGAAGCAGGGAATTCGCTGTGTCATGCTGTTCTGCCGTG 

3420 

Qy 3421 GTTAGAACTTAGCTGTTCTGCTGGGAGCTAGGAGCAGGCTTGCCGCCCCCTGGGAACACG 

3480 

I I I I I I I I I I II I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3421 GTTAGAACTTAGCTGTTCTGCTGGGAGCTAGGAGCAGGCTTGCCGCCCCCTGGGAACACG 

3480 

Qy 3481 CTCACAAGACGGTTCGTCCCCAAAGGAAACAGTGCCCCCCAAACAGGCTTTCAGTCCACT 

3540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 3481 CTCACAAGACGGTTCGTCCCCAAAGGAAACAGTGCCCCCCAAACAGGCTTTCAGTCCACT 

3540 

Qy 3541 CT GTAAT CT G CAC CT T C CC CT C CAG GAT T GAAC CAAAGAT GC AT TT C CGGTTTT GT GACT 

3600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3541 CTGTAATCTGCACCTTCCCCTCCAGGATTGAACCAAAGATGCATTTCCGGTTTTGTGACT 

3600 

Qy 3601 GTGCCACTCTGTGTGTCTCTTGTGGAACCTGGTGTTGTCTGATCCTGTCCGGCTGGCGCT 

3660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3601 GTGCCACTCTGTGTGTCTCTTGTGGAACCTGGTGTTGTCTGATCCTGTCCGGCTGGCGCT 

3660 

Qy 3661 GGATGGAGGACT GT CTCT GT GT GCAT CGT GGGCCCT GGTACTTAGCAGAGGACAAAGGGT 

3720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3661 GGATGGAGGACT GTCTCTGTGTGCATCGTGGGCCCTGGTACTTAGCAGAGGACAAAGGGT 

3720 

Qy 3721 ACT GT TGT CAGGAGGGGAAGACTT GGCAC G GGCT G GACCACAGTTAGTT T AGAAGT TAT G 

3780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 3721 ACTGTTGTCAGGAGGGGAAGACTTGGCACGGGCTGGACCACAGTTAGTTTAGAAGTTATG 

3780 

Qy 3781 GAACAGCTCAGAATCTTCTGGTCTTTGACTATTTCAGATGGGGTCAGAGACCAGAGCTGT 

3840 

I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db ■ 3781 GAACAGCTCAGAATCTTCTGGTCTTTGACTATTTCAGATGGGGTCAGAGACCAGAGCTGT 

3840 

Qy 3841 AG C CAGGAAGCC AGGTT CAT CAT CTT GGT C CAT C GATT CTAAAGT GG G CAAATT T CT GT G 

3900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3841 AG C CAGGAAGCC AG GTT CAT CAT CT T GGT C CAT C GATT CTAAAGT G G GCAAATT T CT GT G 

3900 

Qy 3901 ACGTCACAAAGCCGGCCTTTGCCAGTGAGGGCTGAGACACAGTACAACTGCCTCTCATTT 

3960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3901 ACGTCACAAAGCCGGCCTTTGCCAGTGAGGGCTGAGACACAGTACAACTGCCTCTCATTT 

3960 

Qy 3961 ACTGGTGGCAGGCGGCTTCCTTTGGCCTCTCAGAGCTCTGACTGAACTAGAAGAGAACAC 

4020 

I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3961 ACT GGTGGCAGGCGGCTTCCTTTGGCCTCTCAGAGCTCT GACT GAACTAGAAGAGAACAC 

4020 

Qy 4021 GGATTTGGCTGACCCTGGAAGAAAGCTGCTCTAGTCCTGGCTGAATTTGGTAAGACCTGG 

4080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4021 GGATTTGGCTGACCCTGGAAGAAAGCTGCTCTAGTCCTGGCTGAATTTGGTAAGACCTGG 

4080 



Qy 4081 ACTACTTAAACCTTAGGGAGGGACTGACTCCCTCCCGAGGACCCATTACAGGAGGAGGCC 

4140 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 081 ACT ACT TAAACCTT AGGGAG GGACT GACT C C CT CC C GAGGAC CCATT ACAG GAGGAGGC C 

4140 

Qy 4141 AGGCTTTTCTCCCAGAGCTGATGGTGTTCTTCATTCAGCATGGCTTCCGTTCAGCTCCCA 

4200 

I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I 
Db 4141 AGGCTTTTCTCCCAGAGCTGATGGTGTTCTTCATTCAGCATGGCTTCCGTTCAGCTCCCA 

4200 

Qy 4201 GGACT T GACACT GAAAATAGAACT CTTTAAGCAGAGAGAAGAGGAGAAC CAT CCACAGAC 

4260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 4201 GGACTT GACACT GAAAATAGAACT CTTTAAGCAGAGAGAAGAGGAGAAC CAT CCACAGAC 
4260 

Qy 4261 GCTCCCCGTATTTGATGTGACGTGTTTGAGCTTTGACGGGTGAAGAGTCCTTTTAAAAGA 

4320 

. I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I 
Db 4261 GCTCCCCGTATTTGATGTGACGTGTTTGAGCTTTGACGGGTGAAGAGTCCTTTTAAAAGA 

4320 

Qy 4321 TAACTGCCAGCTGCAGGCATCTGGCTCTGCAAAGCTGGTAGGATGTGTACCTGTGTACTG 

4380 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 4321 TAACTGCCAGCTGCAGGCATCTGGCTCTGCAAAGCTGGTAGGATGTGTACCTGTGTACTG 
4380 

Qy 4381 TGCCCGCCCCCTTTCTCCTAGCCCTTTATGTCTTTTTCTGACTGTTTGCTTTTCTCGTAT 

4440 

I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 4381 TGCCCGCCCCCTTTCTCCTAGCCCTTTATGTCTTTTTCTGACTGTTTGCTTTTCTCGTAT 

4440 

Qy 4441 GTATGTGTGCCTGTGTTGGTGCGAGCCTGTGGAGAAAGAGTCTCCCATCCTTCAAATGCT 

4500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 4441 GTATGTGTGCCTGTGTTGGTGCGAGCCTGTGGAGAAAGAGTCTCCCATCCTTCAAATGCT 

4500 

Qy 4501 TCGAGAACAGCGTCAGATGTACAACTAGTTTGCCTGCGTTGCTACTGGTACCTTGGACTC 

4560 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 4501 TCGAGAACAGCGTCAGATGTACAACTAGTTTGCCTGCGTTGCTACTGGTACCTTGGACTC 

4560 

Qy 4561 T GAACT CAGGTT AC CCAC CT GAGT C CT CAGTAGGCAGTGGAC CCATT GAGAGGCAAATGA 

4620 

I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4561 TGAACT C AG GTT AC C CAC CT GAGT C CT CAGT AGGCAGT GGAC CCAT T GAGAGGCAAAT GA 

4620 



Qy 



4 621 GAACAGGAGGGAGACAAGCT GT GTT CT GG GGCGCACATAAAC AC CT GACAGACGAGT CT A 



4680 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4621 GAAC AGGAGGGAGACAAG CT GT GTT CT GGGG C GC AC ATAAACACCT GACAGAC GAGT CTA 

4680 

Qy 4681 GGAAAC CG C GT GAAAGAAGAAAT GT TAAATT CTTTATT GT TTTAT TAT AT TT AT AT GGAA 

4740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4681 GGAAAC CG C GT GAAAGAAGAAAT GT TAAATT CTTTATT GT TT TATT AT AT TT AT AT GGAA 

4740 



Qy 4741 AATGTGGCTATCCTTTTGTTAAGTGCAGAGTGTATTGTCTGTTTGACCCATGACTGTCCT 

4800 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 4741 AATGTGGCTATCCTTTTGTTAAGTGCAGAGTGTATTGTCTGTTTGACCCATGACTGTCCT 

4800 



Qy 4 801 TCATGAATGAGTCTTTGCCTGTGATTCTAGTCAGCCTGTGGCTACTGATGGGAACGGCCG 

4860 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4801 TCATGAATGAGTCTTTGCCTGTGATTCTAGTCAGCCTGTGGCTACTGATGGGAACGGCCG 

4860 



Qy 4 861 AT CTGT CAT CATGT GAAGTCCAGGAGGAAGAATCTATTTTAGTCATACGATTTGGT CATG 

4920 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 4861 AT CTGT CAT CATGTGAAGTCCAGGAGGAAGAATCTATTTTAGT CATACGATTTGGTCAT G 

4920 

Qy 4921 AGTAAGGACTATATTTAT GT CACCACTATT GAATATATGTACTTTTATAAT GGCTGT GAA 

4980 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4921 AGT AAG GACTAT ATT T AT GT CAC CACTAT T GAAT AT AT GT ACT TT TATAAT GGCT GT GAA 

4980 



Qy 4981 AT ACACT T TTT CCT CACAAAAAAAAAAAAAAAAAAAAAAAAAAA 5024 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 4 981 ATACACTTTTTCCT CACAAAAAAAAAAAAAAAAAAAAAAAAAAA 5024 
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AF055919 5026 bp mRNA linear ROD 28-APR- 
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REFERENCE 3 (bases 1 to 5026) 

AUTHORS Korobko,I.V. 

TITLE Direct Submission 

JOURNAL Submitted (28-APR-1999 ) Laboratory of Cancer Molecular Genetics, 
Institute of Gene Biology, Russian Academy of Sciences, 34/5 
Vavilov Str., Moscow 117334, Russia 
REMARK Sequence update by submitter 
COMMENT On Apr 28, 1999 this sequence version replaced gi: 3659508. 

FEATURES Location/Qualifiers 
source 1. .5026 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="A/Sn" 
/db_xref="taxon: 10090" 
/ cell_line= ,f adenocarcinoma VMR-Liv" 
/tissue_type= "mammary gland" 
/note="high metastatic potential" 
gene 1. .5026 

/gene="Mak-v" 
CDS 60. .2204 

/gene="Mak-v" 
/codon_start=l 

/product="putative serine/threonine protein kinase MAK-V" 
/protein_id="AAC61489 . 1" 
/db_xref="GI: 3659509" 

/trans la tion="MPAAAGDGLLGEPAAPGGDGGAEDTTRPAAACEGSFLPAWVSGV 



SRERLRDFQHHKRVGNYLIGSRKLGEGS FAKVREGLHVLTGEKVAI KVI DKKRAKKDT 
YVTKNLRREGQIQQMIRHPNITQLLDILETENSYYLVMELCPGGNLMHKIYEKKRLDE 
AEARRYI RQLI SAVEHLHRAGWHRDLKI ENLLLDEDNNI KLI DFGLSNCAGI LGYS D 
PFSTQCGSPAYAAPELLARKKYGPKIDWSIGWMYAMLTGTLPFTVEPFSLRALYQK 
MVT)KAMNPLPTQLSTGAWFLRSLLEPDPV7tt*PNIQQALANRW 

PNRISLEDLSPS VVLHMTEKLGYKNSDVINTVLSNRACHILAIYFLLNKKLERYLSGK 
SDIQDSICYKTQLYQIEKCRATKEPYEASLDTWTRDFEFHAVQDKKPKEQEKRGDFLH 
RPFSKKLDKNLPSHKQPSPSLITQLQSTKALLKDRKASKSGFPDKDSFVCRNLFRKTS 
DSNCVASSSMEFIPVPPPRTPRIVKKLEPHQPGPGSASILPKEEPLLLDMVRSFESVD 



REDHIELLSPSHHYRILSSPVSLARRNSSERTLSQGLLSGSTSPLQTPLHSTLVSFAH 



EEKNSPPKEEGVCSPPPVPSNGLLQPLGSPNCVKSRGRFPMMGIGQMLRKRHQSLQPS 

SERSLDASMS PLQPTAPS SLS FDMADGVKGQC " 
misc_f eature 243. .1019 

/gene= M Mak-v" 

/note= f, encodes putative catalytic domain" 

ORIGIN 

Query Match 96.0%; Score 4821.6; DB 10; Length 5026; 

Best Local Similarity 98.8%; Pred. No. 0; 

Matches 4970; Conservative 0; Mismatches 29; Indels 31; Gaps 

AGGGCAGCCCCGGGAGCCGGAGGAGGAGCGGCTGCGAGCGCGGGAGCCGAGCGAGCGCGA 7 2 
I II I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
AGGGCAGCCCCGGGAGCCGGAGGAGGAGCGGCTGCGAGCGCGGGAGCCGAGCGAGCGCGA 6 0 

TGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGGGGGGCGATGGAG 132 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGGGGGGCGATGGAG 120 

GCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCCTGCCCGCCTGGG 1 92 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCCTGCCCGCCTGGG 180 

TGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGCGCGTGGGCAACT 2 52 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGCGCGTGGGCAACT 240 

ACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGCGCGAGGGGCTGC 312 
I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGCGCGAGGGGCTGC 300 

ACGT GCT GACGGGAGAAAAGGTAGCTATCAAGGT CAT CGATAAGAAAAGAGCCAAGAAAG 372 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ACGTGCT GACGGGAGAAAAGGTAGCTAT CAAGGT CATCGATAAGAAAAGAGCCAAGAAAG 360 

ACACCTACGTCACCAAAAACCT GCGTCGAGAGGGGCAGATCCAGCAGAT GAT CCGACACC 4 32 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACAC CTAC GT CAC CAAAAAC CT GCGTC GAGAGG GGCAGAT C CAGCAGAT GAT CC GACAC C 420 

CCAACAT CACACAGCT C CT G GACAT CTT GGAGACAGAGAACAG CT ACTACCT G GT CAT GG 4 92 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
C CAACAT C ACACAGCTC CT GGACAT CT T GGAGACAGAGAACAG CTACTAC CT GGT CAT GG 4 80 

AG CT GT GT C CT GGT GGCAAC CT CAT GCACAAGAT CT AC GAAAAGAAAC GGT T GGAT GAAG 552 
I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
AGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAACGGTTGGATGAAG 540 

CCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCGG 612 

I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I f I I I I I I I I I I I I 
CCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCGG 600 

613 GGGT GGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGAT GAAGACAATAATATCA 672 



10; 




Qy 


13 


Db 


1 


Qy 


73 


Db 


61 


Qy 


133 


Db 


121 


Qy 


. 193 


Db 


181 


Qy 


253 


Db 


241 


Qy 


313 


Db 


301 


Qy 


373 


Db 


361 


Qy 


433 


Db 


421 


Qy 


493 


Db 


481 


Qy 


553 


Db 


541 


Qy 


613 



Db 



I I I I I I I I I I I I I I I i I I I I I I I I I I 1 I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I 

601 GGGT GGT T CACAGAGACT T GAAGATAGAGAATTT GCTACT AGAT GAAGACAATAAT AT CA 660 



Qy 673 AGCT GAT T GACT T T GGCT.T GAGCAACT GT G CAGGGAT C CTAGGT TACT C GGAT C CAT T C A 732 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I 

Db 661 AGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTCA 720 

Qy 733 GCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATATG 792 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 721 GCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATATG 780 

Qy 793 GCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACCC 852 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACCC 840 

Qy 853 TACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAAG 912 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 TACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAAG 900 

Qy 913 CAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTCC 972 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 901 CAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTCC 960 

Qy 973 T GGAACCAGAC CCT GT GAAGAGGC CGAAT AT C CAG CAAG C G CT GG C GAAT C G CT G GTT GA 

1032 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I hi I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 961 T GGAAC CAGAC C CT GT GAAGAGG CC GAATAT C CAGCAAGC GCT GGC GAAT C GCT GGT T GA 

1020 

Qy 1033 AT GAGAAT T ACACT G GAAAGGT GCC CT G CAAT GT CAC CT AT CCCAACAGGATTT CTT TGG 

1092 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I IN I I I I I I I I I I I I I I I I 

Db 1021 ATGAGAATTACACTGGAAAGGTGCCCTGCAATGTCACCTATCCCAACAGGATTTCTTTGG 

1080 

Qy 1093 AAGAC CT GAGT CC CAGC GT GGT GCT GCACAT GACT GAAAAGCT GGGCT ATAAGAACAGT G 

1152 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1081 AAGAC CT GAGT CC CAGC GT GGT GCT GCACAT GACT GAAAAG CT GGGCT ATAAGAACAGT G 

1140 

Qy 1153 ACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTGT 

1212 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 ACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTGT 

1200 

Qy 1213 TGAACAAGAAACTTGAGCGCTATTTGT CAGGGAAATCAGATAT CCAAGATAGCAT CTGCT 

1272 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 1201 T GAACAAGAAACT T GAGCGCTATTT GT CAGGGAAAT CAGAT AT CCAAGATAGCAT CT GCT 

1260 

Qy 1273 ACAAGAC CCAG CT CT AC CAGAT AGAGAAGT GCAGAGC CACCAAGGAGCC CT AT GAGGCCT 

1332 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 



Db 1261 ACAAGAC CCAGCT CT AC CAGATAGAGAAGTGCAGAGC CACCAAGGAGCC CT AT GAGGCCT 

1320 

Qy 1333 CCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATAAAAAGCCCAAAG 

1392 

I I I 1 I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1321 CCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATAAAAAGCCCAAAG 

1380 

Qy 1393 AACAAGAAAAAAGAGGT GATTTTCT CCACCGTCCGTTTT CCAAGAAGTTGGACAAGAACC 

1452 

I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1381 AAC AAG AAAAAAGAG GT GAT T T T C T C C AC CGTCCGTTTTC C AAGAAGT T G GAC AAGAAT C 

1440 

Qy 1453 TGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGAGTACCAAAGCCC 

1512 

I I I I I I II I I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1441 TGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGAGTACCAAAGCCC 

1500 

Qy 1513 TGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATTCCTTCGTCTGCC 

1572 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 TGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATTCCTTCGTCTGCC 

1560 

Qy 1573 GCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTTCCATGGAATTCA 

1632 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1561 GCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTTCCATGGAATTCA 

1620 

Qy 1633 T CCCT GT CCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGCCACACCAACCAG 

1692 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1621 TCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGCCACACCAACCAG 

1680 

Qy 1693 GGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGCT 

1752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 GGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGCT 

1740 

Qy 1753 CCTTT GAGTCT GT GGAT CGAGAGGACCACATAGAACT GCTGTCCCCTTCTCACCATTAT A 

1812 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 1741 CCTTT GAGT CT GT G GAT C GAGAGGACCACAT AGAACT GCTGTCCCCTTCT CAC CAT TATA 

1800 

Qy 1813 GGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCCC 

1872 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1801 GGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCCC 

1860 



Qy 1873 AGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTCT 

1932 

I I! I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 1 I I I 
Db 1861 AGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTCT 

1920 

Qy 1933 CTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCTC 

1992 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db . 1921 CTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCTC 

1980 

Qy 1993 CCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTGTGAAGAGCAGGG 

2052 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1981 CCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTGTGAAGAGCAGGG 

2040 

Qy 2053 GAC GGTT C C C CAT GAT GG GCAT C G GACAGAT GCT GAGGAAGC G GCAC C AGAGC CT G CAGC 

2112 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2041 GAC GGTT C C C CAT GAT GGGCAT C GGACAGAT GCT GAGGAAGC GGCACC AGAGC CT G CAGC 

2100 

Qy 2113 CTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCATAGCCCCCTCCA 

2172 

I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 2101 CTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCACAGCCCCCTCCA 

2160 

Qy 2173 GCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGTTAACCTGGGATGGCAAGAT 

2232 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2161 GCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGTTAACCTGGGATGGCAAGAT 

2220 

Qy 2233 TCT GGGTCT CT GT GAGGACAGCCACGGAACAGAGCT CCACACAGGCAGGCACCAGGGCAT 

2292 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2221 TCTGGGTCTCTGTGAGGACAGCCACGGAACAGGGCTCCACACAGGCAGGCACCAGGGCAT 

2280 

Qy 22 93 GGGT GAACAAC CT CACGG GAGCAT CCT TT AT T CT TT T AT AC CT GCCACACAAAGT CCCAC 

2352 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 2281 GGGT GAACAACCT CAC GGGAG CAT C CT T TAT T CT TT T AT AC CT GCCACACAAAGT C C CAC 

2340 

Qy 2353 GCTTGTATCAGCTGAAGTCCAC — ACTCAAAGTCCACGCACTTACTTAGGGACCCTCTGA 

2410 

I I I I I I I I I I II I I I I I II I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 2341 GCTTGTATCAGCTGAAGTCCACAGACTCAAAGTCCACACACTTACTTAGGGACCCTCTGA 

2400 

Qy 2411 GACGCTGCCACTAGGGGGAGGGGGAGGGGGCAGACTGTGGGAATCACACCTTCCAGCCTG 



2470 

II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 2401 GACGCTGCGACTAGGGGGAGGGGGAGGGGGCGAACTGTGGGAATCACACCTTCCAGCCTG 

2460 

Qy 2471 AGATTTTCTTTGC--TATCACCAATCACTGAGCCCTCTCCAGGATCCCCTCAGTGGGCTC 

2528 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 24 61 AGATTTTCTTTGCTATATCACCAATCACTGAGCCCTCTCCAGGATCCCCCCAGTGGGCTC 

2520 

Qy 2529 AGAGCTAAAAACCACAC CTC CAT CT GCTG GGC CAAT CAGATT T C CAGACT G GTACCAGGT 

2588 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 2521 AGAGCTAAAAAC C ACAC CTC CAT CT G CT GGG C CAAT CAGATT T C CAGACT GGTACCAGGT 

2580 

Qy 2589 TGTCCCTCCCCTCCTCTCTGTGTGTCTCTCACAGTTCTGTAACTGACCGTCAGTGGTCAG 

2648 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2581 TGTCCCTCCCCTCCGCTCTGTGTGTCTCTCACAGTTCTGTAACTGACCGTCAGTGGTCAG 

2640 

Qy 2 64 9 TTACAGTCTCACGCGGACGTGCCACTCGCTGGTAAGGACGTTCACCCAACCTAGGGATCC 

2708 

M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2641 T TACAGT CT CACGC GGAC GT GC CACT C GCT GGTAAG GACGTT CAC CCAAC CT AG GGAT C C 

2700 

Qy 2709 CT CTACAGAGG GAAG CAACC CT C CTT T CC CTAACAGTGAGT C C CCACAGAGT GCTGAGT C 

2768 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I II I I 
Db 27 01 CT CTACAGAGGGAAGCAACC C C C CT T T CCCTAACAGT GAGT CC G CAC AGAGT GCTGAGT C 

2760 

Qy 2769 ACAGT GCT GGACC GGGAGGAAGAT GGGAT G GC GC CT CAGAC AGAGAT GGAAC CCAGCAGC 

2828 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2761 ACAGTGCTGGACCGGGAGGAAGAT GGGAT GGCGCCT CAGACAGAGAT GGAACCCAGCAGA 

2820 

Qy 2829 GAGAAC C C AGGAG GAAGACGAAGACT C AAAC GCT CAT T C CT GT GCAAC GTT TT GACA 

2885 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II 

Db 2821 GAGAAC C CAGGAG GAAGACGAAGACT CATT AAAC GCT CAT T CCT GT G CAAC GTT TT GACA 

2880 

Qy 2886 GATTTTTCTTTCCTCTCTTTCTTTTTCCCCTGACCTTTTCTTCTTTTTGGGTTGAAACTT 

2945 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 2881 GGTTTTTCTTTCCTCTCTTTCTTTTTCCCCTGACCTT TTCTTTTTGGGTTGAAATTT 

2937 

Qy 2946 GCTGAGGAT TGAAC GAACTT GT CCAAAGAGAT CTTT CTTT AT ATGAAGT CATTAATTAAA 

3005 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 2938 GCTGAGGATT GAAC GAACTT GT C CAAAGAGAT CTT T CTTTAT AT GAAGT CAT TAAT TAAA 

2997 

Qy 3006 TTTTTTTTTTAAAGACAGGGTCTCATTAAGTAGCCCAAGCTGGCTTCAAACTCAT 

3060 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I ! I I I I I I I I I I I 1 I I I 
Db 2998 TTTTTTTTTTTTTTTAAAGACAGGGTCTCATTAAGTAGCCCAAGCTGGCTTCAAACTCAT 

3057 

Qy 3061 GATC CT C CT GC CT CAG C CT C CAAAGT GCT GAGAT T ACAAGT ATATAC CCGTGTCTGGCTC 

3120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3058 GATCCTCCTGCCTCAGTCTCCAAAGTGCTGAGATTACAAGTATATATCCGTGTCTGGCTC 

3117 

Qy 3121 AAAATAGCAATT CAAAAACAAAAACTAGTT GGC CAGAT GAAAAGT AGT TTTAC CAAAT T C 

3180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I 
Db 3118 AAAAT AGCAATTCAAAAACAAAAACT AGT T GGC CAGAT GAAAAGT AGTT TT AC CAAATT C 

3177 

Qy 3181 ACGTGTTTTTGTTTTTCTGAGAGGCTGCAGCTCAGATGGCCAAAAAGC TGG 

3231 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I III II III 
Db 3178 ACGTGTTTTTGTTTTTCTGAGAGGCTGCAGCTCAGATGGCCTAAAGGCTGGCTTGGGTGG 

3237 

Qy 3232 CAACAGGAGGACCACAGTGGCCTGCCTGCCTAAGGGATAGTAGCCTAGCCATCCTGTGTT 

3291 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I 
Db 3238 CAACAGGAGGACCACAGTGGCCTGCCTGCCTAAGGGATAGTAGCCTAGCCATCCTGTGTT 

3297 

Qy 3292 TAT AC C GT G GCAG CAGCAGAAGGCAT AGAACTT AGCTCCAGAT GG CT CT GGAGAGAGAGA 

3351 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I II I 
Db 3298 TAT AC C GT GGCAACAGCAGAAGGC AT AGAACT T AGCT C CAGAT GGCT CT GGAGAGAGAGA 

3357 

Qy 3352 AAGGATT CTTAAAGCAGAGTT GAGACAGCAAGAAGCAGGGAATTCGCT GT GTCATGCTGT 

3411 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3358 AAG GAT T CT T AAAG CAGAGTT GAGACAGCAAGAAGCAGG GAAT T C GCT GT GT CAT GCT GT 

3417 

Qy 3412 TCTGCCGTGGTTAGAACTTAGCTGTTCTGCTGGGAGCTAGGAGCAGGCTTGCCGCCCCCT 

3471 

I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3418 TCTGCCGTGGTTAGAACTTAGCTGTTCTGCTGGGAGCTAGGAGCAGGCTTGCCGCCCCCT 

3477 

Qy 3472 GGGAACACGCTCACAAGACGGTTCGTCCCCAAAGGAAACAGTGCCCCCCAAACAGGCTTT 

3531 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 3478 GGGAACACGCTCACAAGACGGTTCGTCCCCAAAGGAAACAGTGCCCCCCAAACAGGCTTT 

3537 



Qy 3532 CAGTCCACTCTGTAATCTGCACCTTCCCCTCCAGGATTGAACCAAAGATGCATTTCCGGT 

3591 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3538 CAGT C CAT GCT GTAAT CT GCAC CTT C C C CT C CAGGATT GAACCAAAGAT GCATT T C C GGT 

3597 

Qy 3592 TTTGTGACTGTGCCACTCTGTGTGTCTCTTGTGGAACCTGGTGTTGTCTGATCCTGTCCG 

3651 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 3598 TTTGTGACTGTGCCACTCTGTGTGTCTCTTGTGGAACCTGGTGTTGTCTGATCCTGTCCG 

3657 

Qy 3652 GCTGGCGCTGGATGGAGGACTGTCTCTGTGTGCATCGTGGGCCCTGGTACTTAGCAGAGG 

3711 

I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3658 GCTGGCGCTGGATGGAGGACTGTCTCTGTGTGCATCGTGGGCCCTGGTACTTAGCAGAGG 

3717 

Qy 3712 ACAAAGGGTACTGTTGTCAGGAGGGGAAGACTTGGCACGGGCTGGACCACAGTTAGTTTA 

3771 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 

Db 3718 ACAAAGGGCACTGTTGTCAGGAGGGGAAGACTTGGCACGGGCTGGACCACAGTTAGTTTA 

3777 

Qy 3772 GAAGT T AT GGAACAG CT CAGAAT CT T CT GGT CT T T GACT AT TT CAGAT G GGGT CAGAGAC 

3831 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 3778 GAAGTT AT G GAACAGCT CAGAAT CT TCT GGC CT TT GACT AT TT CAGAT GGGGT CAGAGAC 

3837 

Qy 3832 CAGAGCT GTAGCCAGGAAGCCAGGTTCATCATCTT GGTCCAT CGATT CTAAAGTGGGCAA 

3891 

I I I I I I I I I I I I I I I II II I II I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I II I 
Db 3838 CAGAGCT GTAGCCAGGAAGCCAG GTT CAT CAT CT T GGT C CAT C GATT CTAAAGT GGGCAA 

3897 

Qy 3892 ATTTCTGTGACGTCACAAAGCCGGCCTTTGCCAGTGAGGGCTGAGACACAGTACAACTGC 

3951 

I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3898 ATTTCTGTGACGTCACAAAG-CGGCCTTTGCCAGTGAGGGCTGAGACACAGTACAACTGC 

3956 

Qy 3952 CTCTCATTTACTGGTGGCAGGCGGCTTCCTTTGGCCTCTCAGAGCTCTGACTGAACTAGA 

4011 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 3957 CTCTCATTTACTGGTGGCGGGCGGCTTCCTTTGGCCTCTCAGAGCTCTGACTGAACTAGA 

4016 

Qy 4012 AGAGAACACGGATTTGGCTGACCCTGGAAGAAAGCTGCTCTAGTCCTGGCTGAATTTGGT 

4071 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 4017 AGAGAACACGGATTTGGCTGACCCTGGAAGAAAGCTGCTCTAGTCCTGGCTGAATTTGGT 

4076 

Qy 4072 AAGACCT GGACTACT TAAACCTTAGGGAG GGACT GACT C CCT CCC GAGGAC CCATTACAG 



4131 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4077 AAGACCTGGACTACTTAAACTTTAGGGAGGGACTGACTCCCTCCCGAGGACCCATTACAG 

4136 

Qy 4132 GAGGAGGCCAGGCTTTTCTCCCAGAGCTGATGGTGTTCTTCATTCAGCATGGCTTCCGTT 

4191 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4137 GAGGAGGCCAGGCTTTTCTCCCAGAGCTGATGGTGTTCTTCATTCAGCATGGCTTCCGTT 

4196 

Qy 4192 CAGCTCCCAGGACTTGACACTGAAAATAGAACTCTTTAAGCAGAGAGAAGAGGAGAACCA 

4251 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III. 
Db 4197 CAGCT CCCAGGACTTGACACTGAAAATAGAACT CTTTAAGCAGAGAGAAGAGGAGAGCCA 

4256 

Qy 4252 TCCACAGACGCTCCCCGTATTTGATGTGACGTGTTTGAGCTTTGACGGGTGAAGAGTCCT 

4311 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4257 TC CACAGAC GCT CT C C GT ATTT GAT GT GACGT GTTT GAGCT T T GAC GGGT GAAGAGT C CT 

4316 

Qy 4-312 TT TAAAAGATAACT G C CAGCT GCAG GCAT C T GGCT CTG CAAAG CT GGTAG GAT GT GTAC C 

4371 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4317 TTTAAAAGATAACTGCCAGCTGCAGGCATCTGGCTCTGCAAAGCTGGTAGGATGTGTACC 

4376 

Qy 4372 TGTGTACTGTGCCCGCCCCCTTTCTCCTAGCCCTTTATGTCTTTTTCTGACTGTTTGCTT 

4431 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 4377 TGTGTACTGTGCCCGCCCCCTTTCTCCTAGCCCTTTATGTCTTTTTCTGACTGTTTGCTT 

4436 

Qy 4432 TTCTCGTATGTATGTGTGCCTGTGTTGGTGCGAGCCTGTGGAGAAAGAGTCTCCCATCCT 

4491 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I 
Db 44 37 TTCTCGTATGTATGTGTGCCTGTGTTGGTGCGAGCCTGTGGAGAAAGAGTCTCCCATCCT 

4496 

Qy 44 92 TCAAATGCTTCGAGAACAGCGTCAGATGTACAACTAGTTTGCCTGCGTTGCTACTGGTAC 

4551 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 44 97 TCAAATGCTTCGAGAACAGCGTCAGATGTACAACTAGTTTGCCTGCGTTGCTACTGGTAC 

4556 

Qy 4552 CTT GGACTCT GAACT CAG GT TAC C CACCT GAGT C CT CAGT AGGCAGT GGACC CAT T GAGA 

4611 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 

Db 4557 CTTGGACTCTGAACTCAGGTTACCCACCTGAGTCCTCAGTAGGCAGTGGACCCATTGAGA 

4616 

Qy 4612 GG CAAAT GAGAAC AG GAGGGAGACAAGCT GT GTT CT GGG GC GC ACATAAACACCT GACAG 

4671 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 

4676 



4617 GGCAAATGAGAACAGGAGGGAGACAAGCTGTGTTCTGGGGCGCACATAAACACCTGACAG 



Qy 4 672 AC GAGT CT AGGAAACC GCGT GAAAGAAGAAAT GTTAAATT CT TTAT T GTT TTAT TAT ATT 

4731 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 4 677 AC GAGT CTAGGAAACC GCGT GAAAGAAGAAAT GTTAAAT T CT TT ATT GTT TT ATT AT ATT 

4736 

Qy 4732 TATATGGAAAATGTGGCTATCCTTTTGTTAAGTGCAGAGTGTATTGTCTGTTTGACCCAT 

4791 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4737 TATACGGAAAAT GT GGCTAT CCTTTT GTTAAGT GCAGAGT GTATTATCTGTTT GAC CCAT 
4796 

Qy 4792 GA CTGTCCTTCATGAATGAGTCTTTGCCTGTGATTCTAGTC-AGCCTGTGGCTACT 

4846 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 

Db 4797 GACTGTCTGTCCTTCATGAATGAGTCTTTGCCTGTGATTCTAGTCAAGCCTGTGGCTACT 

4856 

Qy 4847 GAT G GGAAC GGC CGAT CT GT CAT CAT GT GAAGT C CAGGAG GAAGAAT CT ATTTT AGT CAT 

4906 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I III I I I I I I I I I I I I I I I I I I II I 

Db 4857 GAT GGGAAC GG C C GAT CT GT CAT CAT GT GAAGT CCAGGAGGAAGAGT CTATTT T AGTC AT 

4916 

Qy 4907 ACGA-TTTGGT CATGAGTAAGGACTATATTTAT GT CACCACTATTGAATATATGTACTTT 

4965 

MM M II M I II M I I I I I M I I I I I II I I II I II II I M II II M II I II I II II II 

Db 4 917 AC GATTTTGGT CAT GAGTAAGGACTATATTTAT GTCACCACTATT GAATATATGTACTTT 

4976 

Qy 4 966 TATAATGGCTGTGAAATACACTTTTTCCTCACAAAAAAAAAAAAAAAAAA 5015 

I I M II I II I I I M II II I I II II II I I I I II II I I II I I I II I I II M I 

Db 4977 TATAATGGCTGTGAAATACACTTTTTCCTCACAAAAAAAAAAAAAAAAAA 5026 
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SUMMARIES 



Description 



1 


2142 


42.6 


2 


2138.8 


42.6 


3 


1750 


34.8 


4 


1714.6 


34.1 


5 


1714.6 


34.1 


6 


484.2 


9.6 


7 


482.2 


9.6 


8 


482.2 


9.6 


9 


482.2 


9.6 


10 


482.2 


9.6 



2142 8 AAL60328 

2142 8 AAL60327 

7385 4 AAH72843 

2145 4 AAF44660 

2145 12 ADI29358 

675 12 ACH87498 

656 4 AAI19670 

656 4 AAI20180 

656 4 AAI20139 

656 4 ABA65209 



Aal60328 Centrosom 
Aal 60327 Centrosom 
Aah72843 Human cer 
Aaf44660 Novel pro 
Adi 29358 Human MAR 
Ach87498 Human gen 
Aai 19670 Probe #96 
Aai 20180 Probe #10 
Aai 20139 Probe #10 
Aba65209 Human foe 



Page 1 



us-10-032-256a-l. rng 





11 


482.2 


9.6 


656 


4 


ABA64696 


Aba64696 


Human foe 




12 


482.2 


9.6 


656 


4 


ABA65168 


Aba65168 


Human foe 




13 


482.2 


9.6 


656 


4 


AAI45339 


Aai 45339 


Probe #14 




14 


482.2 


9.6 


656 


4 


AAI45380 


Aai 45380 


Probe #14 




15 


482.2 


9.6 


656 


4 


AAI 44865 


Aai 44865 


Probe #13 




16 


482.2 


9.6 


656 


4 


ABA46816 


Aba46816 


Human bre 




17 


482.2 


9.6 


656 


4 


ABA47283 


Aba47283 


Human bre 




18 


482.2 


9.6 


656 


4 


ABA47324 


Aba47324 


Human bre 




19 


482.2 


9.6 


656 


4 


ABA31820 


Aba31820 


Probe #10 




20 


482.2 


9.6 


656 


4 


ABA32310 


Aba32310 


Probe #10 




21 


482.2 


9.6 


656 


4 


ABA32270 


Aba32270 


Probe #10 




22 


482.2 


9.6 


656 


4 


AAK38862 


Aak38862 


Human bon 




23 


482.2 


9.6 


656 


4 


AAK39367 


Aak39367 


Human bon 




24 


482.2 


9.6 


656 


4 


AAK39326 


Aak39326 


Human bon 




25 


482.2 


9.6 


656 


4 


AAK13626 


Aakl3626 


Human bra 




26 


482.2 


9.6 


656 


4 


AAK13586 


Aakl3586 


Human bra 




27 


482.2 


9.6 


656 


4 


AAK13131 


Aakl3131 


Human bra 




28 


482.2 


9.6 


656 


4 


ABS38447 


Abs 38447 


Human liv 




29 


482.2 


9.6 


656 


4 


ABS38914 


Abs 38914 


Human liv 




30 


482.2 


9.6 


656 


4 


ABS38955 


Abs38955 


Human liv 




31 


482.2 


9.6 


656 


5 


AAI05884 


Aai 05884 


Probe #58 




32 


482.2 


9.6 


656 


5 


AAI05389 


Aai 05389 


Probe #53 




33 


482.2 


9.6 


656 


5 


AAI05844 


Aai 05844 


Probe #58 




34 


482.2 


9.6 


656 


6 


ABS13454 


Absl3454 


Human gen 




35 


482.2 


9.6 


656 


6 


ABS12943 


Absl2943 


Human gen 




36 


482.2 


9.6 


656 


6 


ABS13413 


Absl3413 


Human gen 


c 


37 


395.8 


7.9 


528 


4 


AAI22604 


Aai 22604 


Probe #12 


c 


38 


395.8 


7.9 


528 


4 


ABA67679 


Aba67679 


Human foe 


c 


39 


395.8 


7.9 


528 


4 


AAI47896 


Aai 47896 


Probe #16 


c 


40 


395.8 


7.9 


528 


4 


ABA49766 


Aba49766 


Human bre 


c 


41 


395.8 


7.9 


528 


4 


ABA34746 


Aba34746 


Probe #13 


c 


42 


395.8 


7.9 


528 


4 


AAK41838 


Aak41838 


Human bon 


c 


43 


395.8 


7.9 


528 


4 


AAK16096 


Aakl6096 


Human bra 


c 


44 


395.8 


7.9 


528 


4 


ABS41434 


Abs41434 


Human liv 


c 


45 


395.8 


7.9 


528 


5 


AAI08280 


Aai 08280 


Probe #82 



ALIGNMENTS 



RESULT 1 




AAL60328 




ID 


AAL60328 standard; DNA; 2142 


BP. 


XX 






AC 


AAL60328; 




XX 






DT 


27-AUG-2003 (first entry) 




XX 






DE 


Centrosome-associated kinase 


Mak V wild-type DNA. 


XX 




KW 


Centrosome-associated kinase; 


cell cycle progression; therapy; 


KW 


cytostatic; Mak V; gene; ds. 




XX 






OS 


Unidentified. 




XX 






PN 


WO2003038078-A2. 




XX 






PD 


08-MAY-2003. 




XX 






PF 


31-OCT-2002; 2002WO-GB004940. 




XX 






PR 


02-NOV-2001; 2001GB-00026415. 




XX 
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PA (UYWA-) UNIV WARWICK. 
XX 

PI Stott D, seung-woon s, Craig GM; 
XX 

DR" WPI; 2003-441358/41. 
XX 

PT New centrosome-associated kinase with decreased or no kinase activity, 

PT useful for identifying inhibitors or activators of cell cycle 

PT progression, for use in manufacturing a treatment for uncontrolled cell 

PT cycle progression. 

XX 

PS Disclosure; Page 16-17; 23pp; English, 
xx 

cc The invention relates to centrosome-associated kinase with decreased or 

cc no kinase activity useful for identifying inhibitors or activators of 

CC cell cycle progression. The invention is useful for identifying an 

CC inhibitor or activator of cell cycle progression, which is useful in 

cc manufacturing a medicament for treating uncontrolled cell cycle 

CC progression. The present sequence is centrosome-associated kinase Mak V 

cc wild-type DNA 
XX 

SQ sequence 2142 BP; 532 A; 620 C; 576 G; 414 T; 0 U; 0 Other; 

Query Match 42.6%; Score 2142; DB 8; Length 2142; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 2142; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

ATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGGGGGGCGATGGA 131 

Mill Mill Mill III Mill Mill Mill II III M Mill II Mill I II II III 

ATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGGGGGGCGATGGA 60 
GGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCCTGCCCGCCTGG 19 1 

IMMM MIM M 1 1 1 M M 1 1 1 1 M M 1 1 1 1 Ml 1 1 1 M 1 1 1 M 1 1 1 II 1 1 MM M I 

GGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCCTGCCCGCCTGG 12 0 

GTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGCGCGTGGGCAAC 251 

II I II II II I II II I II II I I II I II II II II I II II I II II II I II II II I I II II II I 
GTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGCGCGTGGGCAAC 180 

TACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGCGCGAGGGGCTG 311 

II II IMMM MM II II II III I Mill Mill II Ml II III III MM I MM II I 

TACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGCGCGAGGGGCTG 2 40 

CACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAAGAGCCAAGAAA 371 

I II I II I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I 
CACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAAGAGCCAAGAAA 300 

GACACCTACGTCACCAAAAACCTGCGTCGAGAGGGGCAGATCCAGCAGATGATCCGACAC 431 

II I I I I II I II I II I I I I I I I II II I II II I I I I I I I I I II II I I I I I I I I I I II II II I 
GACACCTACGTCACCAAAAACCTGCGTCGAGAGGGGCAGATCCAGCAGATGATCCGACAC 3 60 

CCCAACATCACACAGCTCCTGGACATCTTGGAGACAGAGAACAGCTACTACCTGGTCATG 491 

M II II III II 1 1 1 1 M II MM 1 1 MM 1 1 1 1 III I Mill Ml III III 1 1 MM II I 

CCCAACATCACACAGCTCCTGGACATCTTGGAGACAGAGAACAGCTACTACCTGGTCATG 420 
GAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAACGGTTGGATGAA 551 

M Ml I III II 1 1 I I II I I I MM I I MIMI I Ml I M III I II II I 1 1 IMMM III 

GAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAACGGTTGGATGAA 480 
GCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCG 6 11 

1 1 II II III MM 1 1 M M 1 1 M I II 1 1 II II 1 1 1 1 1 II 1 1 M II II MIMI MM II I 

GCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCG 5 40 



Qy 


72 


Db 


1 


Qy 


132 


Db 


61 


Qy 


192 


Db 


121 


Qy 


252 


Db 


181 


Qy 


312 


Db 


241 


Qy 


372 


Db 


301 


Qy 


432 


Db 


361 


Qy 


492 


Db 


421 


Qy 


552 


Db 


481 
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Qy 


612 


Db 


541 


Qy 


672 


Db 


601 


Qy 


732 


Db 


661 


Qy 


792 


Db 


721 


Qy 


852 


Db 


781 


Qy 


912 


Db 


841 


Qy 


972 


Db 


901 


Qy 


1032 


Db 


961 


Qy 


1092 


Db 


1021 


Qy 


1152 


Db 


1081 


Qy 


1212 


Db 


1141 


Qy 


1272 


Db 


1201 


Qy 


1332 


Db 


1261 


Qy 


1392 


Db 


1321 


Qy 


1452 


Db 


1381 


Qy 


1512 


Db 


1441 
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GGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 

I I II 1 1 MM II I I I I 1 1 I II II II II II II I II I Ml 1 1 I M II I I I I III II I MM I 

GGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 



671 
600 
731 



AAGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTC 

I II I III III I II I II I I I II I Mill II I II III III II II I I Ml I I II I Ml II I II 

AAGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTC 660 



AGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATAT 

I I I I I I I II I I M I Mill M II I I I I II III I II I I I I I I M II I I I I II I II I Mill 

AGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATAT 

GGCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I 1 I I I I I I I I I II I I I I I 
GGCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACC 

CTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAA 

II I I MM II II I M I I MM IMMM II I III II Ml II I I II I II I I II I I II III 

CTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAA 



791 
720 
851 
780 
911 
840 
971 



GCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTC 

II I II 1 1 II 1 1 M I MM I II II II I II II III II MM Ml I Ml I II 1 1 Ml I II 1 1 1 

GCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTC 900 
CTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGCTGGCGAATCGCTGGTTG 10 3 1 

II MM II II II II MM Ml III II MMM M MM MM II M II II III I II II I 

CTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGCTGGCGAATCGCTGGTTG 960 
AATGAGAATTACACTGGAAAGGTGCCCTGCAATGTCACCTATCCCAACAGGATTTCTTTG 1091 

I III MMM I 1 1 1 1 1 M II I Ml MMM M I Ml II II I M 1 1 M 1 1 1 1 MM Ml M 

AATGAGAATTACACTGGAAAGGTGCCCTGCAATGTCACCTATCCCAACAGGATTTCTTTG 102 0 
GAAGACCTGAGTCCCAGCGTGGTGCTGCACATGACTGAAAAGCTGGGCTATAAGAACAGT 115 1 

I M II II MM I M I II II II MMMMI M III MM II Ml II I M Mill II II II 

GAAGACCTGAGTCCCAGCGTGGTGCTGCACATGACTGAAAAGCTGGGCTATAAGAACAGT 10 80 
GACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTG 12 11 

M II I M II M M I I II I II I M I MMMMI II I II II III M I I II II MM II I II 

GACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTG 1140 
TTGAACAAGAAACTTGAGCGCTATTTGTCAGGGAAATCAGATATCCAAGATAGCATCTGC 1271 

II II I II II I M II II II II II MMM II III II III I II I I I II I I III MM MM I 

TTGAACAAGAAACTTGAGCGCTATTTGTCAGGGAAATCAGATATCCAAGATAGCATCTGC 1200 
TACAAGACCCAGCTCTACCAGATAGAGAAGTGCAGAGCCACCAAGGAGCCCTATGAGGCC 1331 

MM II II II M I I II I Ml MM MM II II II I I MM II II I I II II MM III I I 

TACAAGACCCAGCTCTACCAGATAGAGAAGTGCAGAGCCACCAAGGAGCCCTATGAGGCC 12 60 
TCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATAAAAAGCCCAAA 1391 

I II 1 1 Mill I II I II II 1 1 II II 1 1 II II 1 1 III I II I III 1 1 II I II I II II I II I II _ 

TCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATAAAAAGCCCAAA 1320 
GAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGTTGGACAAGAAC 14 5 1 

II II I MM I I III III M I I I MMMM M I II Ml I II M II I M M I I II I M I M 

GAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGTTGGACAAGAAC 1380 
CTGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGAGTACCAAAGCC 1511 

II Ml II II MM I II II 1 1 II MIIIMI Ml II I II I II I I II I II II I MM II II I 

CTGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGAGTACCAAAGCC 1440 
CTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATTCCTTCGTCTGC 1571 

M MM I II II M I I I I I Ml I MMMM M I II Ml I Ml I Ml II II I Mill I I II 

CTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATTCCTTCGTCTGC 1500 
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us-10-032-256a-l. rng 



Qy 


1572 


CGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTTCCATGGAATTC 


1631 




1 1 1! 1! 1 Mill 1 1 1 II II 1 II Mill Ml 1 II IN Mill Ml III 1 1! II 1 MMM 1 


1I>00 


Db 


1501 


CGCAATc MM CCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTTCCATGGAATTC 


Qy 


1632 


ATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGCCACACCAACCA 


1691 




1 1 II 1 II M 1 1 Ml 1 II Mill MMMM III 1 II II 1 MM MM II 1 III 1 1 1 1 1 1 


1620 


Db 


1561 


ATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGCCACACCAACCA 


Qy 


1692 


GGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGC 


1751 




1 1 MM 1 II 1 1 1 1 1 1 II 1 1 1 M MMMM 1 II II 1 MM 1 1 II II 1 1 Mill 1 II Ml 1 




Db 


1621 


GGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGC 


1680 


Qy 


1752 


TCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTTCTCACCATTAT 


1811 




MM 1 II II 1 M 1 M II M M 1 II 1 II 1 M 1 M II M M M 1 M M M M M M 1 M M 1 




Db 


1681 


TCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTTCTCACCATTAT 


1740 


Qy 


1812 


AGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCC 


1871 




1 1 1 M II Ml 1 1 1 1 1 M 1 1 1 1 1 III Mill 1 II III 1 II 1 1 1 1 1 1 1 1 1 1 MM II 1 M 1 1 




Db 


1741 


AGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCC 


1800 


Qy 


1872 


CAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTC 


1931 




II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 M 1 II 1 1 M 1 1 M 1 1 1 1 1 1 II 1 1 1 1 




Db 


1801 


CAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTC 


1860 


Qy 


1932 


TCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCT 


1991 




1 1 1 II 1 1 III 1 II 1 1 1 1 1 II II III Mill 1 1 1 1 1 1 1 1 1 II II III 1 1 1 1 Ml 1 1 1 II 1 1 




_ 1 

Db 


1861 


TC M M GCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCT 


1920 


Qy 


1992 


CCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTGTGAAGAGCAGG 


2051 




1 1 1 II 1 Ml II 1 M 1 1 1 II II MMM 1 1 1 II Mill 1 1 II 1 1 1 1 1 1 II 1 1 III M 1 1 1 




Db 


1921 


CCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTGTGAAGAGCAGG 


1980 


Qy 


2052 


GGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCACCAGAGCCTGCAG 


2111 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M III 1 1 1 1 1 1 1 II II 1 II 1 1 1 Ml 1 M II II 1 II 1 1 1 II II 1 




Db 


1981 


GGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCACCAGAGCCTGCAG 


2040 


Qy 


2112 


CCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCATAGCCCCCTCC 


2171 




1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2041 


CCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCATAGCCCCCTCC 


2100 


Qy 


2172 


AGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGT 2213 






MM III MINI 1 1 1 llllll MMIIIII lllllllllll 




Db 


2101 


AGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGT 2 142 

• 
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us-10-032-256a-l. rnpb 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 
Searched: 



April 20, 2005, 05:51:28 



US-10-032-256A-1 
5024 

1 gcaggaggagccagggcagc. 



Search time 2639 Seconds 

(without alignments) 

11549.510 Million cell updates/sec 



. aaaaaaaaaaaaaaaaaaaa 5024 



IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

5622541 seqs, 3033355566 residues 



Total number of hits satisfying chosen parameters: 



11245082 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : Published^Applications_NA:* 

1 : /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB . seq : * 

2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB . seq : * 

3 : /cgn2_6/ptodata/l/pubpna/us06_NEW_PUB . seq : * 

4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB . seq : * 

5 : /cgn2_6/ptodata/l/pubpna/US07_NEW_PUB . seq : * 

6 : /cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB . seq : * 

7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB . seq : * 

8 : /cgn2_6/ptodata/l/pubpna/US08_PUBCOMB . seq : * 

9 : /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB . seq : * 
10 : /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB . seq : * 
11 : /cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB . seq : * 
12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB . seq : * 
13 : /cgn2_6/ptodata/l/pubpna/USlO^PUBCOMB . seq : * 
14: /cgn2_6/ptodata/l/pubpna/USlOB_PUBCOMB.seq:* 
15 : /cgn2_6/ptodata/l/pubpna/USlOC_PUBCOMB . seq : * 
16 : /cgn2_6/ptodata/l/pubpna/USlOD_PUBCOMB . seq : * 
17 : /cgn2_6/ptodata/l/pubpna/USl0E_PUBCOMB . seq : * 
18 : /cgn2_6/ptodata/l/pubpna/USl0F_PUBCOMB . seq : * 
19 : /cgn2_6/ptodata/l/pubpna/USl0_NEW_PUB . seq : * 
20 : /cgn2_6/ptodata/l/pubpna/USll_NEW_PUB . seq : * 
21: /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB . seq : * 
22 : /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 

Result Query 

No. Score Match Length DB ID 



SUMMARIES 



Description 

Sequence 20693, A 



484.2 9.6 675 16 US-10-029-386-20693 
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2 


482.2 


9.6 


656 


9 




3 


482.2 


9.6 


656 


9 




4 


482.2 


9.6 


656 


9 


c 


5 


395.8 


7.9 


528 


9 




6 


377.2 


7.5 


482 


9 




7 


377.2 


7.5 


482 


11 




8 


377.2 


7.5 


482 


11 




9 


377.2 


7.5 


482 


17 


c 


10 


345 


6.9 


474 


9 




11 


338.8 


6.7 


470 


9 




12 


272.6 


5.4 


459 


9 




13 


272.6 


5.4 


459 


9 




14 


229 


4.6 


297 


9 




15 


229 


4.6 


297 


9 




16 


194.4 


3.9 


457 


9 




17 


180 


3.6 


502 


16 




18 


174.8 


3.5 


1205 


18 


c 


19 


174.8 


3.5 


1205 


18 




20 


174.8 


3.5 


1205 


19 


c 


21 


174.8 


3.5 


1205 


19 




22 


174.8 


3.5 


56153 


17 




23 


171.2 


3.4 


472 


9 




24 


171.2 


3.4 


472 


9 




25 


155 


3.1 


1975 


17 




26 


155 


3.1 


1975 


17 




27 


148.8 


3.0 


2110 


18 




28 


148.6 


3.0 


1317 


17 




29 


148.6 


3.0 


2022 


17 




30 


148.2 


2.9 


2150 


18 


c 


31 


146.8 


2.9 


466 


9 




32 


146.6 


2.9 


1770 


17 




33 


145.4 


2.9 


474 


9 




34 


143.8 


2.9 


2073 


17 




35 


143.6 


2.9 


2138 


17 




36 


143.6 


2.9 


2922 


18 




37 


143.4 


2.9 


1933 


17 




38 


143.4 


2.9 


1945 


17 




39 


143.4 


2.9 


2124 


18 




40 


143 


2.8 


237 


9 




41 


143 


2.8 


249 


9 


c 


42 


142.8 


2.8 


189 


9 




43 


142.4 


2.8 


2044 


18 




44 


141.8 


2.8 


453 


9 




45 


141.6 


2.8 


1812 


17 



us-10-032-256a-l. rnpb 
US-09-864-761-17140 
US-09-864-761-17590 
US-09-864-761-17630 
US-09-864-761-20066 
US-09-764-868-224 
US-09-764-875-72 
US-09-764-875-389 
US-10-242-355-186 
US-09-864-761-3290 
US-09-864-761-326 
US-09-864-761-807 
US-09-864-761-849 
US-09-864-761-17591 
US-09-864-761-17631 
US-09-864-761-325 
US-10-029-386-6982 
US-10-363-345A-19069 
US-10-363-345A-19070 
US-10-363-483A-19069 
US-10-363-483A-19070 
US-10-221-714A-519 
US-09-864-761-808 
US-09-864-761-850 
US-10-424-599-3569 
US-10-425-114-8714 
US-10-739-930-5028 
US-10-425-114-9772 
US-10-425-114-6306 
US-10-425-115-101612 
US-09-864-761-426 

US-10-425-114-35453 
US-09-864-761-10591 
US-10-424-599-3567 
US-10-425-114-2343 
US-10-425-115-128036 
US-10-425-114-3580 
US-10-425-114-28167 
US-10-425-115-28948 
US-09-864-761-27212 
US-09-864-761-20252 
US-09-864-761- 17225 

US-10-425-115-154128 
US-09-864-761-3482 
US-10-425-114-29601 



Sequence 17140, A 
Sequence 17590, A 
Sequence 17630, A 
Sequence 20066, A 
Sequence 224, App 
Sequence 72, Appl 
Sequence 389, App 
Sequence 186, App 
Sequence 3290, Ap 
Sequence 326, App 
Sequence 807, App 
Sequence 849, App 
Sequence 17591, A 
Sequence 17631, A 
Sequence 325, App 
Sequence 6982, Ap 
Sequence 19069, A 
Sequence 19070, A 
sequence 19069, A 
Sequence 19070, A 
Sequence 519, App 
Sequence 808, App 
Sequence 850, App 
Sequence 3569, Ap 
Sequence 8714, Ap 
Sequence 5028, Ap 
Sequence 9772, Ap 
Sequence 6306, Ap 
Sequence 101612, 
Sequence 426, App 

Sequence 35453, A 
Sequence 10591, A 
Sequence 3567, Ap 
Sequence 2343, Ap 
Sequence 128036, 
Sequence 3580, Ap 
Sequence 28167, A 
Sequence 28948, A 
Sequence 27212, A 
Sequence 20252, A 
Sequence 17225, A 
Sequence 154128, 
Sequence 3482, Ap 
Sequence 29601, A 



ALIGNMENTS 



RESULT 1 

US-10-029-386-20693 

; Sequence 20693, Application US/10029386 

; Publication No. US20030194704A1 

; GENERAL INFORMATION: 

; APPLICANT: Penn, sharron G. 

; APPLICANT: Rank, David R. 

; APPLICANT: Hanzel, David K. 

; TITLE OF INVENTION: HUMAN GENOME-DERIVED SINGLE EXON NUCLEIC ACID PROBES USEFUL 
FOR GENE 

; TITLE OF INVENTION: EXPRESSION ANALYSIS TWO 
; FILE REFERENCE: AEOMICA-X-2 
; CURRENT APPLICATION NUMBER: US/10/029,386 
; CURRENT FILING DATE: 2001-12-20 
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us-10-032-256a-l. rnpb 

; NUMBER OF SEQ ID NOS: 34288 

; SOFTWARE : Annomax Sequence Listing Engine vers. 1.1 

; SEQ ID NO 20693 
; LENGTH: 675 
; TYPE: DNA 

; ORGANISM: Homo sapiens 
; FEATURE: 

; OTHER INFORMATION: MAP TO AP001712.1 

; OTHER INFORMATION: EXPRESSED IN HEART, SIGNAL =2.8 

; OTHER INFORMATION: EXPRESSED IN BONE MARROW, SIGNAL =2.8 

; OTHER INFORMATION: EXPRESSED IN FETAL LIVER, SIGNAL =2.3 

; OTHER INFORMATION: EXPRESSED IN ADULT LIVER, SIGNAL =2.5 

; OTHER INFORMATION: EXPRESSED IN PLACENTA, SIGNAL =3.1 

; OTHER INFORMATION: EXPRESSED IN BRAIN, SIGNAL =2.7 

; OTHER INFORMATION: EXPRESSED IN HELA, SIGNAL =1.8 

; OTHER INFORMATION: NT HIT: gi 14780177, EVALUE O.OOe+00 

; OTHER INFORMATION: SWISSPROT HIT: P57058, EVALUE 1.00e-109 

; OTHER INFORMATION: EST_HUMAN HIT: BF529471.1, EVALUE O.OOe+00 

US-10-029-386-20693 



Query Match 9.6%; score 484.2; DB 16; Length 675; 

Best Local Similarity 83.6%; Pred. No. 3.9e-127; 

Matches 549; Conservative 0; Mismatches 108; Indels 0; Gaps 0; 



Qy 




AGATTCCTTCGTCTGCCGCAATC III! CCG AAA A ACCTCTGATTCC A ATTGTGTGGCTTC 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


18 


AGATTCCTTTGGCTGCCGCAATA 1 1 1 1 CCGCAAAACCTCAGATTCCAATTGTGTGGCTTC 


77 


Qy 


1616 


TTCTTCCATGGAATTCATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACT 


1675 




MINIMUM 1 1 1 1 1 1 1 1 II Mill 1 1 1 1 1 1 1 1 II MINIM lllllll 




Db 


78 


TTCTTCCATGGAGTTCATCCCCGTGCCACCGCCCAGGACCCCGAGGATTGTGAAGAAACC 


137 


Qy 


1676 


AGAGCCACACCAACCAGGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCT 


1735 




Mill II II llllllll Mill 1 MUM II MIIIIMI llllll 1 




Db 


138 


GGAGCCCCATCAGCCAGGGCCCGGAAGCACTGGCATCCCCCACAAGGAAGACCCCCTGAT 


197 


Qy 


1736 


GCTGGATATGGTACGCTCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTC 


1795 




llllll Mill llllllll 1 lllllll II llllll Mill lllllll 




Db 


198 


GCTGGACATGGTGCGCTCCTTCGAGTCTGTGGATCGCGACGACCACGTAGAAGTGCTGTC 


257 


Qy 


1796 


CCCTTCTCACCATTATAGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAG 


1855 




II Mill II II Mill MM III II II III lllllll II Mill II 


317 


Db 


258 


TCCCTCTCATCACTACAGGATTCTGAACTCCCCGGTCAGCTTGGCTCGCAGAAATTCCAG 


Qy 


1856 


TGAGAGGACACTCTCCCAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACT 


1915 




llllllll II MM III MM llllllll 1 II llllllll Mill 1 




Db 


318 


CGAGAGGACGCTGTCCCCGGGTCTGCCATCCGGAAGCATGTCGCCTCTCCATACTCCTTT 


377 


Qy 


1916 


GCATTCCACGCTGGTCTCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGG 


1975 




MM 1 II MMMMIIMM llllllll M 1 1 1 1 1 1 1 1 1 1 1 1 MIIMIMM 




Db 


378 


GCATCCAACTCTGGTCTCTTTTGCTCACGAAGATAAGAACAGCCCCCCAAAAGAGGAGGG 


437 


Qy 


1976 


TGTGTGTTCACCGCCTCCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAA 


2035 




llllll II Mill llllllll lllllll 1 1 M 1 M 1 1 1 1 1 II M II II M 




Db 


438 


CCTGTGTTGCCCACCTCCGGTTCCCAGCAATGGCCCCATGCAGCCTCTGGGGAGCCCCAA 


497 


Qy 


2036 


CTGTGTGAAGAGCAGGGGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCG 


2095 




llllllll III 1 II llllllll IIIIIIIIIIIIIIIIMIII 1 llllllll 




Db 


498 


TTGTGTGAAAAGCCGAGGCCGGTTCCCTATGATGGGCATCGGACAGATGTTAAGGAAGCG 


557 


Qy 


2096 


GCACCAGAGCCTGCAGCCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCA 


2155 


Db 


558 


II Mill llllllll II MM III lllllll llllll II MM Mill 

CCATCAGAGTCTGCAGCCATCTGCAGATAGGCCCCTGGAGGCCAGCCTGCCCCCACTGCA 


617 
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2156 GCCCATAGCCCCCTCCAGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTG 2212 

1 1 1 1 1 1 1 1 1 1 1 I III 1 1 1 1 1 II 1 1 1 I I II I I II Mill MINI 

618 GCCCCTAGCCCCTGTGAACCTTGCCTTTGACATGGCCGATGGGGTCAAGACCCAGTG 674 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



April 19, 2005, 23:17:42 ; Search time 14801 Seconds 

(without alignments) 
12920.403 Million cell updates/sec 

US-10-032-256A-1 
5024 

1 gcaggaggagccagggcagc aaaaaaaaaaaaaaaaaaaa 5024 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



68479088 



34239544 seqs, 19032134700 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : EST:* 

1: gb_estl:* 
2: gb_est2:* 
3 : gb_htc : * 
4: gb_est3:* 
5: gb_est4:* 
6: gb_est5:* 
7: gb_est6:* 
8: gb_gssl:* 
9: gb_gss2:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



Mo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


785.2 


15.6 


791 


5 


BU056886 


BU056886 UI-M-FO0- 


2 


783.4 


15.6 


851 


6 


CB845007 


CB845007 M2PN-0573 


3 


781 


15.5 


852 


5 


BQ180567 


BQ180567 UI-M-EX0- 


4 


779.4 


15.5 


864 


6 


CA325070 


CA325070 UI-M-FY0- 


5 


770 


15.3 


771 


6 


CB249687 


CB249687 UI-M-EX0- 


6 


769.4 


15.3 


775 


6 


CA316433 


CA316433 UI-M-FW0- 


7 


759 


15.1 


810 


6 


CB518393 


CB518393 UI-M-GH0- 


8 


753.2 


15.0 


805 


7 


CF729648 


CF729648 UI-M-HD0- 



9 


748 


14 . 


9 


760 


7 


CK635586 


PK6355ft 6 

\-> l\ U J J J(J VJ 


TjT— M— HMO — 


10 


737 


14 . 


7 


768 


5 


BO770004 


RD77 0 004 

U^y/ / / VJ VJ VJ 1 


TJT -M— RT 0 — 

VJ X 1 v l E X U 


11 


732 . 8 


14 . 


6 


736 


6 


CB245694 
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ALIGNMENTS 



RESULT 1 
BU056886 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



BU056886 

UI-M-FO0-bzz-h-14-0-UI , 
, mRNA 



791 bp 
rl NIH_BMAP_ 
sequence. 



mRNA 
FOO Mus 



linear 
musculus 



EST 2 6-AUG-2002 
cDNA clone 



IMAGE: 6412861 I 
BU056886 

BU056886.1 GI:22496963 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; 
1 (bases 1 to 791) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 



Euteleostomi; 
Murinae; Mus. 



JOURNAL Unpublished (1999) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail . nih . gov 

Tissue Procurement: Dr. Jim Lin, University of Iowa 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

http : / / image . llnl . gov 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5. 
FEATURES Location/Qualifiers 
source 1. .791 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6" 
/db_xref="taxon: 10090" 
/clone=" IMAGE: 6412861" 
/tissue_type="whole brain" 
/dev_stage="embryo 12.5dpc" 
/lab_host="DH10B (Tl phage resistant) " 
/ cl o ne_l i b= " N I H_BMAP_FO 0 " 

/note="Organ: Brain; Vector: pYX- Asc; Site_l : EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Soares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site arid the polyA tail 
is TGAGAGAGCC. This library was created for the University 
Iowa Brain Anatomy Project (BMAP) : 'Gene Discovery in the 
Developing Mouse Nervous System* , supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 

ORIGIN 

Query Match 15.6%; Score 785.2; DB 5; Length 791; 

Best Local Similarity 99.5%; Pred. No. 1.3e-176; 

Matches 787; Conservative 0; Mismatches 4; Indels 0; Gaps 0; 



Qy 509 CAACCT CAT G CACAAGAT CTAC GAAAAGAAAC GGTT GGAT GAAGC CGAGG CC CGCAGATA 568 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I II I I I I I I 
Db 1 CAAC CT CAT G CACAAGAT CTAC GAAAAGAAAC GGTT GGAT GAAG C CGAGGC C CGCAGATA 60 

Qy 569 CATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCGGGGGTGGTTCACAGAGA 62 8 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 CAT C C GGCAACT CATCT CT GC G GT G GAACAC CT GCACCGT GC GGG GGT GGTT CACAGAGA 120 

Qy 629 CTT GAAGAT AGAGAATT T GCT ACTAGAT GAAGACAATAAT AT CAAGCTGATT GACT T T GG 688 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CT TGAAGAT AGAGAATT T GCT ACT AGAT GAAGACAATAAT AT CAAGCT GATT GACTTT G G 180 



689 CTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTCAGCACACAGTGTGGCAG 748 

I I I I ! I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I II I I I I I I I 
181 CTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTCAGCACACAGTGTGGCAG 240 

749 CCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATATGGCCCCAAAATTGATGT 808 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I 
241 CCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATATGGCCCCAAAATTGATGT 300 

809 CTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACCCTACCTTTCACTGTGGA 868 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 CTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACCCTACCTTTCACTGTGGA 360 

869 GCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAAGCAATGAATCCCCTGCC 928 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
361 GCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAAGCAATGAATCCCCTGCC 420 

929 GACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTCCTGGAACCAGACCCTGT 988 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I 
421 GACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTCCTGGAACCAGACCCTGT 4 80 

989 GAAGAGGCCGAATATCCAGCAAGCGCTGGCGAATCGCTGGTTGAATGAGAATTACACTGG 1048 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 GAAGAGGC C GAAT AT C C AGCAAGC GCT GG C GAAT C GCT G GTT GAAT GAGAATT ACACT GG 540 

104 9 AAAGGTGCCCTGCAATGTCACCTATCCCAACAGGATTTCTTTGGAAGACCTGAGTCCCAG 1108 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I III 
541 AAAG GT GC C CT GCAAT GT CAC CT AT C C CAACAG GATT T CT T T GGAAGAC CT GAGT CN CAG 600 

1109 CGTGGTGCTGCACATGACTGAAAAGCTGGGCTATAAGAACAGTGACGTCATCAACACGGT 1168 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 CGTGGTGCTGCACATGACTGAAAAGCTGGGCTATAAGAACAGTGACGTCATCAACACGGT 660 

1169 GCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTGTTGAACAAGAAACTTGA 1228 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 

661 GCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTGGTTGACAAGAAACTTGA 720 

1229 GC GCT ATT TGT C AGGGAAAT CAGAT AT C CAAGAT AGCAT CT G CT ACAAGACC C AGCT CT A 1288 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
721 GCGCTATTTGT CAGGGAAAT CAGAT ATCCAAGAT AGCAT CTGCTACAAGACCCAGCTCTA 780 



1289 
781 



C CAG AT AG AG A 1299 

I I I I I I I I I I I 

C CAGAT AGAGA 791 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - protein search, using f rame_plus_n2p model 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



April 20, 2005, 09:58:04 



Search time 312 Seconds 

(without alignments) 

12455.668 Million cell updates/sec 



US-10-032-256A-1 
9026 

1 gcaggaggagccagggcagc aaaaaaaaaaaaaaaaaaaa 5024 



BLOSUM62 
Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



2105692 seqs, 386760381 residues 



Total number of hits satisfying chosen parameters: 



4211384 



Minimum DB seq length: 
Maximum DB seq length: 



2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_n2p .model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US10032256/runat_19042005_181218_13540/app_query. fasta_l. 
191 

-DB=A_Geneseq__16Dec04 -QFMT=fastan -SUFFIX=n2p . rag -MINMATCH=0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS=bits -START=1 -END— 1 -MATRIX=blosum62 -TRANS=human40 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MO DE= LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US10032256_@CGN_l_l_366_@runat_19042005__181218_13540 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 



Database 



A Geneseq 16Dec04:* 



geneseqpl980s : * 
geneseqpl990s : * 
geneseqp2000s : * 
geneseqp2001s : * 
geneseqp2002s : * 
geneseqp2003as : * 
geneseqp2003bs : * 
geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 



score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
AA029754 

ID AA029754 standard; protein; 714 AA. 
XX 

AC AA029754; 
XX 

DT 27-AUG-2003 (first entry) 
XX 

DE Centrosome-associated kinase Mak V (K91R) mutant protein. 
XX 

KW Centrosome-associated kinase; cell cycle progression; therapy; enzyme; 

KW cytostatic; Mak V; gene; mutant; mutein. 

XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 

FT Misc-dif f erence 91 

FT /note= "Wild-type Lys is substituted with Arg" 
XX 

PN WO2003038078-A2. 
XX 

PD 08-MAY-2003. 
XX 

PF 31-OCT-2002; 2002WO-GB004940 . 
XX 

PR 02-NOV-2001; 2001GB-00026415 . 
XX 

•PA (UYWA-) UNIV WARWICK. 
XX 

PI Stott D, Seung-Woon S, Craig GM; 
XX 

DR WPI; 2003-441358/41. 

DR N-PSDB; AAL60327. 
XX 

PT New centrosome-associated kinase with decreased or no kinase activity, 

PT useful for identifying inhibitors or activators of cell cycle 

PT progression, for use in manufacturing a treatment for uncontrolled cell 

PT cycle progression. 

XX 

PS Claim 2; Page 14-16; 23pp; English. 
XX 

CC The invention relates to centrosome-associated kinase with decreased or 

CC no kinase activity useful for identifying inhibitors or activators of 

CC cell cycle progression. The invention is useful for identifying an 

CC inhibitor or activator of cell cycle progression, which is useful in 

CC manufacturing a medicament for treating uncontrolled cell cycle 

CC progression. The present sequence is centrosome-associated kinase Mak V 

CC mutant protein 

XX 

SQ Sequence 714 AA; 
Alignment Scores: 



Pred. No. : 


3.29e-297 


Length: 


714 


Score : 


3722.00 


Matches : 


713 


Percent Similarity: 


100. 00% 


Conservative : 


1 


Best Local Similarity: 


99.86% 


Mismatches : 


0 


Query Match: 


41.24% 


Indels : 


0 



DB: 6 Gaps: 0 

US-10-032-256A-1 (1-5024) x AA029754 (1-714) 



Qy 


72 


ATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGGGGGGCGATGGA 


131 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


Mpt~ProAl aAl aAl aGl vAsnGl vTieuLeniGl vGluP roAlaAl aP roGl vGl vA^nGl v 


20 


Qy 


132 


GGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCCTGCCCGCCTGG 


191 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




nh 


c. X 


Gl \/Al ^Gl nA^nTh r*TVi t~A rnPrnAl a Al a Al aPv<;Gl uGl v^prPhpT.puPrnAl aTrn 
uiyn_Laox Li_r\.o x 1 1 x x 11 y ri ^j^vj. aru. ctru.auy o ui uuiyjci r iiciicur x. vj./tx ci x x ky 


4 0 


Qy 


192 


GTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGCGCGTGGGCAAC 


251 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




nh 


4 1 


1 ^prfZl \A/a 1 ^ptAyttGI iiArrfT.piiAmA^'nPV'ipGl nHi <?Hi T.\/<; A r rrVa 1 Gl uA <;n 




Qy 


252 


TACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGCGCGAGGGGCTG 


311 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 






D -L 


T\;rT.pnTl pGI w^p t~A rrfT.v«?T.PiiGl \/Gl nGl v^ptPVipAI aT.u<>Val ArnGl uGl vT.pii 
i. y i. ijc ui xc o ly jc x ai uliv o Jjcuvxyvjxuvjxy jcxt nc/\x g xi y o v a xrvx u<jx uux y iicu 


80 


Qy 


312 


CACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAAGAGCCAAGAAA 


371 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 






ft 1 

O J. 


T-TH qVa 1 T.phTHkGI vGl iiTiV^Va 1 Al a T 1 ^AirrVa 1 T 1 ^A^nTiVsTiV 4 ? ArcrAl aTiVsTiVS 
11 x o vctxiicuiiHux yuXULiyD v a x/vx axx crvx y v a j — l i caj pxiyouy o ni y ru. auy ouyo 


100 


Qy 


372 


GACACCTACGTCACCAAAAACCTGCGTCGAGAGGGGCAGATCCAGCAGATGATCCGACAC 


431 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l-l 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i m 


A coTH rT\7 r~\fa 1 TH Y*T.\/c:Ac!'nT.piiArrrAKrfGl nGl vGl nTl pHl nGl nMph Tl pArrrHi 
rtop x 1 1 x. i y x vcixxiixxjy Dn.onxic ixnx ^al y ox uox y vjxiii xcoxiiuxiu ic lii cm ynxo 


120 


Qy 


432 


CCCAACATCACACAGCTCCTGGACATCTTGGAGACAGAGAACAGCTACTACCTGGTCATG 


491 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


X X 


ProA<;nTl pTH r-Gl nT,piiT.piiA«;rjTl f»T.p*uGl nThrGl uA^n^e* rTvrTvrLpuVa 1 Me t~ 
t. x Urioiix x tr x ii X vjxiixicuxicurvo x xcucuux u x mux unoiis/c x x y x. lytucuvaiiici. 


140 


Qy 


492 


GAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAACGGTTGGATGAA 


551 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 d 1 

14 1 


Gl iiT.PiirwQPrn^l x/Gl \/A«inT.piiMp1~T-H <?T.\/<?Tl pTurPIl nT.\/c;Ti\/c; A rrrTiPii AsriGl u 
ox Li x_i t; Li y o r xuoxyox y.r\o nxic uric inxsiiysiici y xuxuuyouy j r\±. y uc u_t\o^jo j. ui 


160 


Qy 


552 


GCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCG 


611 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




nh 


i 61 

lul 


Al aGI iiAl aArnArrfTurTl pArrrGl nT.pnTl p^prAl aVal Gl nHi sTiPllHi sArnAl a 
r\ x q ox u_tt.xci.rtx ijrtx y x y x x xcrtx y uxiixicuixc jc iAxa v »x\jx unxoucunx y a 


180 


Qy 


612 


GGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 


671 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i pi 

X 0 X 


C 1 1 1 W"i c A rrrS cnT pi i T \7cTl oGl nQcnT.PiiT.PiiT.Pii AcnHl nAdnAQnAQnTl p 

OX y V ct X Vdlfll OAl y/\o LJXltS UXjy o X X COX lXrt.o IIXjC U. Xj C Ui Xj C lXTVO LJ O X Uni3 IIAOHX x c 


C. \J VJ 


Qy 


672 


AAGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTC 


731 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


201 


LysLeuIleAspPheGlyLeuSerAsnCysAlaGlylleLeuGlyTyrSerAspProPhe 


220 


Qy 


732 


AGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATAT 


791 






1 1 I 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 




Db 


221 


SerThrGlnCysGlySerProAlaTyrAlaAlaProGluLeuLeuAlaArgLysLysTyr 


240 


Qy 


792 


GGC C C CAAAATT GAT GT CT GGT CAAT AG GC GT GAACAT GT AT GC CAT GCT GAC GGGGAC C 


851 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II 1 1 1 II 1 ! 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


GlyProLysIleAspValTrpSerlleGlyValAsnMetTyrAlaMetLeuThrGlyThr 


260 


Qy 


852 


CTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAA 


911 



Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

261 LeuProPheThrValGluProPheSerLeuArgAlaLeuTyrGlnLysMetValAspLys 280 

912 GCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTC 971 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
281 AlaMetAsnProLeuProThrGlnLeuSerThrGlyAlaValAsnPheLeuArgSerLeu 300 

972 CTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGCTGGCGAATCGCTGGTTG 1031 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
301 LeuGluProAspProValLysArgProAsnlleGlnGlnAlaLeuAlaAsnArgTrpLeu 320 

1032 AAT GAGAATT ACAGT GGAAAGGT GC C CT G CAAT GT CAC CT AT C C CAACAGGATTT CT TT G 1091 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
321 AsnGluAsnTyrThrGlyLysValProCysAsnValThrTyrProAsnArglleSerLeu 340 

1092 GAAGACCT GAGT C CCAGC GT GGT GCT GCACATGACT GAAAAGCT GGGCTATAAGAACAGT 1151 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
341 GluAspLeuSerProSerValValLeuHisMetThrGluLysLeuGlyTyrLysAsnSer 360 

1152 GACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTG 1211 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
361 AspVallleAsnThrValLeuSerAsnArgAlaCysHisIleLeuAlalleTyrPheLeu 380 

1212 TT GAACAAGAAACTT GAGCGCT ATTT GT CAG G GAAAT CAGAT AT C CAAGAT AGCAT CT GC 1271 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
381 LeuAsnLysLysLeuGluArgTyrLeuSerGlyLysSerAspIleGlnAspSerlleCys 400 

1272 T ACAAGAC C C AGCT CTAC CAGAT AGAGAAGT GCAGAGC CAC CAAGGAGC C CT AT GAGGC C 1331 

I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
401 TyrLysThrGlnLeuTyrGlnlleGluLysCysArgAlaThrLysGluProTyrGluAla 420 

1332 T C C CT GGAT AC CT GGAC GAGGGACT T T GAATT C CAT G CT GT GCAGGATAAAAAGC C CAAA 1391 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 SerLeuAspThrTrpThrArgAspPheGluPheHisAlaValGlnAspLysLysProLys 440 

1392 GAACAAGAAAAAAGAGGT GATTTT CTCCACCGT CCGTTTTCCAAGAAGTT GGACAAGAAC 1451 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I 
441 GluGlnGluLysArgGlyAspPheLeuHisArgProPheSerLysLysLeuAspLysAsn 460 



1452 



461 



1512 



481 



1572 



501 



1632 



521 



CTGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGAGTACCAAAGCC 1511 
I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LeuProSerHisLysGlnProSerProSerLeuIleThrGlnLeuGlnSerThrLysAla 4 80 

CTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATTCCTTCGTCTGC 1571 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LeuLeuLysAspArgLysAlaSerLysSerGlyPheProAspLysAspSerPheValCys 500 

CGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTTCCATGGAATTC 1631 
I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

ArgAsnLeuPheArgLysThrSerAspSerAsnCysValAlaSerSerSerMetGluPhe 520 

AT C C CT GT CC CAC CT C C CAG GAC ACCAAGGATT GTAAAGAAACT AGAGC CACACCAACC A 1691 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

IleProValProProProArgThrProArglleValLysLysLeuGluProHisGlnPro 540 



1692 GGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGC 1751 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I 



541 GlyProGlySerAlaSerlleLeuProLysGluGluProLeuLeuLeuAspMetValArg 560 

1752 T CCTTT GAGT CT GT GGAT C GAGAGGAC CACATAGAACT GCT GT C CC CTT CT CAC CAT TAT 1811 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
561 SerPheGluSerValAspArgGluAspHisIleGluLeuLeuSerProSerHisHisTyr 580 

1812 AGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCC 1871 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
581 ArglleLeuSerSerProValSerLeuAlaArgArgAsnSerSerGluArgThrLeuSer 600 

1872 CAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTC 1931 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
601 GlnGlyLeuLeuSerGlySerThrSerProLeuGlnThrProLeuHisSerThrLeuVal 620 

1932 TCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCT 1991 
I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
621 SerPheAlaHisGluGluLysAsnSerProProLysGluGluGlyValCysSerProPro 640 

1992 CCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTGTGAAGAGCAGG 2051 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
641 ProValProSerAsnGlyLeuLeuGlnProLeuGlySerProAsnCysValLysSerArg 660 

2052 GGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCACCAGAGCCTGCAG 2111 
I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
661 GlyArgPheProMetMetGlylleGlyGlnMetLeuArgLysArgHisGlnSerLeuGln 680 

2112 CCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCATAGCCCCCTCC 2171 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
681 ProSerSerGluArgSerLeuAspAlaSerMetSerProLeuGlnProIleAlaProSer 700 

2172 AGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGT 2213 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
701 SerLeuSerPheAspMetAlaAspGlyValLysGlyGlnCys 714 
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OM nucleic - protein search, using f rame_plus_n2p model 

Run on: April 20, 2005, 16:33:04 ; Search time 268.5 Seconds 

(without alignments) 
12438.215 Million cell updates/sec 

Title: US-10-032-256A-1 
Perfect score: 9026 

Sequence: 1 gcaggaggagccagggcagc aaaaaaaaaaaaaaaaaaaa 5024 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 1421835 seqs, 332370683 residues 

Total number of hits satisfying chosen parameters: 2843670 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_n2p. model -DEV=xl h 

-Q=/cgn2^1/USPTO^spool/USl0032256/runat_19042005_181221_13651/app_query.fastaU.5191 

-DB=Published^Applications^AA -QFMT=fastan -SUFFlX=n2p. rapb -MINMATCH=0 . 1 
-LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 -MATRIX=bl OSUm62 
-TRANS=human40.cdi -LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 
-THR_MIN=0 -ALIGN=15 -MODE=LOCAL -OUTFMT=ptO -NORM=eXt -HEAPSIZE=500 -MINLEN=0 
-MAXLEN=2000000000 -USER=USl0032256_@CGN^LL361_@runat^l9042005^181221_13651 
-NCPU=6 -ICPU=3 -NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 
-LONGLOG -DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0.5 
-FGAPOP=6 -FGAPEXT=7 -YGAPOP=:10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 

Database : Publ i shed_Appl i cati ons_AA : * 

1: /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB . pep : * 
2 : /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB . pep : * 
3 : /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB . pep : * 
4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB . pep : * 
5 : /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB . pep : * 
6 : /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB . pep : * 
7 : /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB . pep : * 
8 : /cgn2_6/ptodata/l/pubpaa/U508_PUBC0MB . pep : * 
9 : /cgn2_6/ptodata/l/pubpaa/US09A_PUBCOMB . pep : * 
10 : /cgn2_6/ptodata/l/pubpaa/us09B_PUBCOMB . pep : * 
11 : /cgn2_6/ptodata/l/pubpaa/us09C_PUBCOMB . pep : * 
12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB . pep : * 
13 : /cgn2_6/ptodata/l/pubpaa/USl0A_PUBCOMB . pep : * 
14 : /cgn2_6/ptodata/l/pubpaa/USl0B_PUBCOMB . pep : * 
15 : /cgn2_6/ptodata/l/pubpaa/USl0C_PUBCOMB . pep : * 
16 : /cgn2_6/ptodata/l/pubpaa/USl0D_PUBCOMB . pep : * 
17 : /cgn2_6/ptodata/l/pubpaa/USl0_NEW_PUB . pep : * 
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18 : /cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB . pep : * 
19 : /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB . pep : * 
20 : /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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Result 

NO. 


Score 
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Match 


Length DB 


ID 


Description 


1 


983 


10.9 


218 


9 


US-09-864-761-33631 


Sequence 33631, A 


2 


983 


10.9 


218 


9 


US-09-864-761-34109 


Sequence 34109, A 


3 


761 


8.4 


157 


9 


US-09-764-868-837 


Sequence 837, App 


4 


761 


8.4 


157 


11 


US-09-764-875-670 


Sequence 670, App 


5 


761 


8.4 


157 


11 


US-09-764-875-987 


Sequence 987, App 


6 


761 


8.4 


157 


15 


US-10-242-355-506 


Sequence 506, App 


7 


651 


7.2 


744 


9 


US-09-919-585-3 


Sequence 3, Appli 


8 


649.5 


7.2 


729 


14 


US-10-142-356-11 


Sequence 11, Appl 


9 


649.5 


7.2 


729 


14 


US-10-195-101-33 


Sequence 33, Appl 


10 


649.5 


7.2 


729 


14 


US-10-161-565-26 


Sequence 26, Appl 


11 


648 


7.2 


825 


15 


US-10-425-114-54516 


Sequence 54516, A 


12 


646. 5 


7.2 


795 


9 


US-09-919-585-12 


Sequence 12, Appl 


13 


646. 5 


7.2 


795 


14 


US-10-142-356-9 


Sequence 9, Appli 


14 


646. 5 


7.2 


795 


14 


US-10-161-565-25 


Sequence 25, Appl 


15 


646.5 


7.2 


795 


15 


US-10-311-034-4 


Sequence 4, Appli 


16 


646.5 


7.2 


795 


17 


US-10-753-267-120 


Sequence 120, App 


17 


644.5 


7.1 


793 


14 


US-10-195-101-32 


Sequence 32, Appl 


18 


642.5 


7.1 


769 


15 


US-10-363-616-403 


Sequence 403, App 


19 


640 


7.1 


752 


9 


US-09-835-081-2 


Sequence 2, Appli 


20 


640 


7.1 


752 


15 


US-10-258-106-16 


Sequence 16, Appl 


21 


640 


7.1 


752 


15 


US-10-276-645-7 


Sequence 7, Appli 


22 


639.5 


7.1 


779 


8 


US-08-817-832B-31 


Sequence 31, Appl 


23 


639.5 


7.1 


779 


15 


US-10-440-435-31 


Sequence 31, Appl 


24 


638.5 


7.1 


787 


16 


US-10-618-941-76 


Sequence 76, Appl 


25 


638 


7.1 


745 


14 


US-10-195-101-36 


Sequence 36, Appl 


26 


638 


7.1 


745 


14 


US-10-161-565-24 


Sequence 24, Appl 


27 


638 


7.1 


745 


15 


US-10-260-708-79 


Sequence 79, Appl 


28 


637.5 


7.1 


713 


14 


US-10-161-565-27 


Sequence 27, Appl 


29 


636 


7.0 


724 


15 


US-10-276-645-5 


Sequence 5, Appli 


30 


636 


7.0 


752 


16 


US-10-618-941-79 


Sequence 79, Appl 


31 


634.5 


7.0 


698 


15 


US-10-016-248-22 


Sequence 22, Appl 


32 


633 


7.0 


744 


9 


US-09-835-081-4 


Sequence 4, Appli 


33 


632.5 


7.0 


688 


15 


US-10-276-645-8 


Sequence 8, Appli 


34 


629.5 


7.0 


660 


15 


US-10-276-645-6 


Sequence 6, Appli 


35 


626.5 


6.9 


703 


15 


US-10-016-248-70 


Sequence 70, Appl 


36 


624.5 


6.9 


688 


14 


US-10-161-565-28 


Sequence 28, Appl 


37 


624.5 


6.9 


688 


14 


US-10-161-565-29 


Sequence 29, Appl 


38 


624 


6.9 


722 


8 


US-08-817-832B-32 


Sequence 32, Appl 


39 


624 


6.9 


722 


15 


US-10-440-435-32 


Sequence 32, Appl 


40 


624 


6.9 


776 


15 


US-10-366-288-6 


Sequence 6, Appli 


41 


623 


6.9 


667 


15 


US-10-016-248-74 


Sequence 74, Appl 


42 


623 


6.9 


722 


15 


US-10-274-194-4 


Sequence 4, Appli 


43 


623 


6.9 


722 


16 


US-10-760-407-4 


Sequence 4, Appli 


44 


622 


6.9 


724 


9 


US-09-919-585-9 


Sequence 9, Appli 


45 


621.5 


6.9 


691 


9 


US-09-919-585-6 


Sequence 6, Appli 



ALIGNMENTS 

RESULT 1 
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us-10-032-256a-l. n2p. rapb 

US-09-864-761-33631 

; Sequence 33631, Application US/09864761 

; Patent No. US20020048763A1 

; GENERAL INFORMATION: 

; APPLICANT: Penn, sharron G. 

; APPLICANT: Rank, David R. 

; APPLICANT: Hanzel, David K. 

; APPLICANT: Chen, Wensheng 

; TITLE OF INVENTION: HUMAN GENOME-DERIVED SINGLE EXON NUCLEIC ACID PROBES USEFUL 
FOR 

; TITLE OF INVENTION: GENE EXPRESSION ANALYSIS BY MICROARRAY 

; FILE REFERENCE: Aeomica-X-1 

; CURRENT APPLICATION NUMBER: US/09/864 , 761 

; CURRENT FILING DATE: 2001-05-23 

; PRIOR APPLICATION NUMBER: US 60/180,312 

; PRIOR FILING DATE: 2000-02-04 

; PRIOR APPLICATION NUMBER: US 60/207,456 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: US 09/632,366 

; PRIOR FILING DATE: 2000-08-03 

; PRIOR APPLICATION NUMBER: GB 24263.6 

; PRIOR FILING DATE: 2000-10-04 

; PRIOR APPLICATION NUMBER: US 60/236,359 

; PRIOR FILING DATE: 2000-09-27 

; PRIOR APPLICATION NUMBER: PCT/US01/00666 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00667 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00664 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00669 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00665 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00668 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00663 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00662 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00661 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/US01/00670 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: US 60/234,687 

; PRIOR FILING DATE: 2000-09-21 

; PRIOR APPLICATION NUMBER: US 09/608,408 

; PRIOR FILING DATE: 2000-06-30 

; PRIOR APPLICATION NUMBER: US 09/774,203 

; PRIOR FILING DATE: 2001-01-29 

; NUMBER OF SEQ ID NOS: 49117 

; SOFTWARE: Annomax Sequence Listing Engine vers. 1.1 

; SEQ ID NO 33631 
; LENGTH: 218 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
; FEATURE: 

; OTHER INFORMATION: MAP TO AP000260.1 

; OTHER INFORMATION: EXPRESSED IN BT474, SIGNAL =3.2 

I OTHER INFORMATION: EXPRESSED IN HELA, SIGNAL =3.8 

; OTHER INFORMATION: EXPRESSED IN ADULT LIVER, SIGNAL =3.3 

; OTHER INFORMATION: EXPRESSED IN BRAIN, SIGNAL = 5 

; OTHER INFORMATION: EXPRESSED IN PLACENTA, SIGNAL =4.1 
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OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
OTHER 
US-09-864- 



INFORMATION 
INFORMATION 
INFORMATION 
INFORMATION 
INFORMATION 
INFORMATION 
INFORMATION 
INFORMATION 
761-33631 



us-10-032-256a-l.n2p. rapb 
EXPRESSED IN BONE MARROW, SIGNAL =4.9 
EXPRESSED IN HBL100, SIGNAL =3.8 
EXPRESSED IN FETAL LIVER, SIGNAL =3.8 
EXPRESSED IN LUNG, SIGNAL =5.4 
EXPRESSED IN HEART, SIGNAL =4.2 
EST_HUMAN HIT: AA368352.1, EVALUE 3.00e-25 
SWISSPROT HIT: P08461, EVALUE l.OOe+00 
EST_HUMAN HIT: BF529471.1, EVALUE 2.00e-87 



Alignment Scores: 
Pred. No.: 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 

US-10-032-256A-1 (1-5024) x US-09-864-761-33631 (1-218) 

TCCTTCGTCTGCCGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCT 1619 
MINI I I I I I I I I I I I I I I II I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
SerPheGlyCysArgAsnllePheArgLysThrSerAspSerAsnCysValAl aserser 20 



1.51e-56 


Length: 


218 


983.00 


Matches : 


184 


92 . 20% 


Conservative: 


17 


84.40% 


Mismatches: 


17 


10 .89% 


Indels: 


0 


9 


Gaps: 


0 



Qy 


1560 


Db 


1 


Qy 


1620 


Db 


21 


Qy 


1680 


Db 


41 


Qy 


1740 


Db 


61 


Qy 


1800 


Db 


81 


Qy 


1860 


Db 


101 


Qy 


1920 


Db 


121 


Qy 


1980 


Db 


141 


Qy 


2040 


Db 


161 


Qy 


2100 


Db 


181 


Qy 


2160 


Db 


201 



TCCATGGAATTCATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I III 



CCACACCAACCAGGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTG 
I I I I I I I I I I I I I I I I I I I I I I I I III I I I I I I : :: I I I I I I ::: I I I 



GATATGGTACGCTCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCT 

1 I 1 I I I t t I I I I I I I 1 I I 1 : : : | | | | 1 | : r : 1 | | r : : | | | | | | 1 | 1 

AspMetvalArgSerPheGluSerValAspArgAspAspHisValGluValLeuSerPro 

TCTCACCATTATAGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAG 

Mil 1 1 J I Ml I MM II 1 1 : : : I II 1 1 M I 1 1 1 1 1 1 II J M I M II M II I II 1 1 1 J 



1679 
40 

1739 

60 

1799 



1859 



81 serHisHisTyrArglleLeuAsnSerProValSerLeuAlaArgArgAsnSerSerGlu 100 



AGGACACTCTCCCAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCAT 

MIIIMIMM MINI I I J I I I I I I lllllllll I I I I I I I I I I I I 
ArgThrLeuSerProGlyLeuProSerGlySerMetSerProLeuHisThrProLeuHis 

TCCACGCTGGTCTCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTG 

IN II IIIIIMIIMIII MM I:::llll IIMMII Mil MM MM III::: 
ProThrLeuVal serPheAl aHi sGl uAspLysAsnSerProProLysGl uGl uGl yLeu 

TGTTCACCGCCTCCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGT 

III Mill I Ml Ml IIMMII MM ::: II 1 1 II II I 1 1 1 1 1 1 1 1 II II II I 
cysCysProProProValProSerAsnGlyProMetGlnProLeuGlySerProAsncys 

GTGAAGAGCAGGGGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCAC 
I II I I I I I I II I II I I I I I I II I I I II II II I I II I I I I I I II II II I II II I I I I I I I I 

" " " il 



1919 

120 

1979 

140 

2039 

160 

2099 



161 Val Lysse rArgGl yArgPheProMetMetGl yll eGlyGl nMetLeuArgLysArgHi s 180 



CAGAGCCTGCAGCCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCC 

1 1 1 1 1 I 1 1 1 1 I I I I I I I I = = = = = = 1 1 1 MI:::IMIII::: MIIIMIMM 

Gl nSerLeuGl nProserAl aAspArgProLeuGl uAl aSerLeuProProLeuGl nPro 

ATAGCCCCCTCCAGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGT 2213 

= = = 111111 :::|||:::|IIIIIIIMIIIIIIIIIIIIII INI 
LeuAlaProValAsnLeuAlaPheAspMetAlaAspGlyValLysThrGlnCys 218 



2159 
200 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - protein search, using f rame_plus_n2p model 



Run on: 



April 20, 2005, 15:00:38 ; Search time 77.5 Seconds 

(without alignments) 
12474.659 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-032-256A-1 
9026 

1 gcaggaggagccagggcagc 



aaaaaaaaaaaaaaaaaaaa 5024 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , Xgapext 

Ygapop 10.0 , Ygapext 

Fgapop 6.0 , Fgapext 

Delop 6.0 , Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 



566832 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=f rame+_n2p . model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US10032256/runat_19042005_181219_13561/app_query.fasta_l. 
191 

-DB=PIRJ79 -QFMT=fastan -SUFFIX=n2p . rpr -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human40 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US10032256_@CGN_l_l_124_@runat_19042005_181219_13561 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY . -NEG__SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT =7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : PIR 79:* 



1 
2 
3 
4 



pirl : * 
pir2: * 
pir3: * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 

RESULT 1 
G01025 

serine/threonine protein kinase - human 
C; Species: Homo sapiens (man) 



C;Date: 21-Dec-1996 #sequence_revision 06-Jun-1997 #text_change 16-Aug-2004 
C;Accession: G01025 
R;Navarro, E. 

submitted to the EMBL Data Library, April 1996 
A; Reference number: H00564 
A;Accession: G01025 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-745 <NAV> 

A; Cross-references: UNIPROT : Q15524 ; UNIPROT: Q96HB3 ; EMBL:X97630; NID:gl310674 

C; Superfamily : protein kinase homology 

F; 18-271/Domain: protein kinase homology <KIN> 

Alignment Scores: 

Pred. No.: 1.57e-19 Length: 745 

Score: 640.00 Matches: 195 

Percent Similarity: 46.21% Conservative: 122 

Best Local Similarity: 28.43% Mismatches: 225 

Query Match: 7.09% Indels: 144 

DB: 2 Gaps: 28 

US-10-032-256A-1 (1-5024) x G01025 (1-745) 

GACTTCCAGCACCACAAGCGCGTGGGCAACTACCTCATCGGCAGCAGGAAGCTGGGAGAG 281 
III III III ::: I I I I I I I I I : : : : : : : : : I I I : : : 

AspGluGlnProHis IleGlyAsnTyrArgLeu LeuLysThrlleGlyLys 28 

GGCTCCTTCGCCAAGGTGCGCGAGGGGCTGCACGTGCTGACGGGAGAAAAGGTAGCTATC 341 
I I I : : : I I I I I I I I I I I I : : : I I I :: : I I I I I I I I I :::::: I I I I I I : : : 

GlyAsnPheAlaLysValLysLeuAlaArgHisIleLeuThrGlyLysGluValAlaVal 4 8 

AAG GT CAT C GAT AAGAAAAGAGC CAAGAAAGACAC CT AC GT CAC CAAAAAC CT GC GT C GA 4 01 

I I I ::: I I I I I I I I I : : : : : : : : : III III 

LysIlelleAspLysThrGlnLeuAsnSerSerSer LeuGlnLysLeuPheArg 66 

GAGGGGCAGAT CCAGCAGATGAT CCGACACCCCAACATCACACAGCT CCT GGACATCTT G 4 61 
III : : : I I I ::::::::: I I I I I I I I I I I I : : : I I I ::::::::: 
GluValArglleMetLysValLeuAsnHisProAsnlleValLysLeuPheGluVallle 86 

GAGACAGAGAACAGCTACT AC CT GGT CAT GGAGCT GT GT C CT G GT GGCAACCT CAT GCAC 521 

I I I I I I I I I : : : I I I I I I I I I I I I I I I I I I I I I : : : 

GluThrGluLysThrLeuTyrLeuValMetGluTyrAlaSerGlyGlyGluValPheAsp 106 

AAGAT CT AC GAAAAGAAAC GGTT GGATGAAGC C GAG GC C C GCAGATAC AT C CGGCAACT C 581 

: : : III::: III I I I I I I I I I I I I I I I : : : 

TyrLeuValAlaHisGlyArgMetLysGluLysGluAlaArgAlaLysPheArgGlnlle 126 

ATCTCTGCGGTGGAACACCTGCACCGTGCGGGGGTGGTTCACAGAGACTTGAAGATAGAG 641 

: : : I I I I I I I I I : : : : : : III::: ::: I I I I I I I I I I I I I I I I I I III 

ValSerAlaValGlnTyrCysHisGlnLysPhelleValHisArgAspLeuLysAlaGlu 14 6 

AATT T GCTACT AGAT GAAGACAATAAT AT CAAGCT GATT GACT TT GG CTT GAGCAACT GT 701 
I I I I I I I I I I I I I II III MINIMI::: I II II I II I I I I I I I 
AsnLeuLeuLeuAspAlaAspMetAsnlleLysIleAlaAspPheGlyPheSerAsn 165 



Qy 


222 


Db 


12 


Qy 


282 


Db 


29 


Qy 


342 


Db 


49 


Qy 


402 


Db 


67 


Qy 


462 


Db 


87 


Qy 


522 


Db 


107 


Qy 


582 


Db 


127 


Qy 


642 


Db 


147 



Qy 



702 



GCAGGGATCCTAGGTTACTCGGATCCATTCAGCACACAGTGTGGCAGCCCTGCCTATGCT 761 
: : : ::: III II I I I I I I I I I I I I I I I I 



Db 


166 


-GluPheThrPheGlyAsnLysLeuAspThrPheCysGlySerProProTyrAla 


183 


Qy 

Db 


762 
184 


GCGCCAGAACTGCTTGCCAGGAAGAAATAT GGCC C CAAAATT GAT GT CT G GT CAAT A 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 :::::: 1 1 1 1 1 1 1 1 1 1 1 1 : : : 
AlaProGluLeuPheGlnGlyLysLysTyrAspGlyProGluValAspValTrpSerLeu 


818 
203 


Qy 

Db 


819 
204 


GGCGTGAACATGTATGCCATGCTGACGGGGACCCTACCTTTCACTGTGGAGCCTTTCAGC 
1 1 1 1 1 1 : : : 1 1 1 ::::::::: | | | ::: | | | | | | | | | : : : : : : 
GlyVallleLeuTyrThrLeuValSerGlySerLeuProPheAspGlyGln Asn 


878 
221 


Qy 


879 


CT GAGGGCT CT GT AT CAGAAGAT GGT GGACAAAGCAAT GAAT CCC CT GC CGAC C CAGCT C 
III::: III :::::::::::: : : : : : : I I I : : : 
LeuLysGluLeuArgGluArgValLeu ArgGlyLysTyrArglleProPheTyrMet 


938 


Db 


222 


240 


Qy 


939 


TCCACAGGGGCCGTCAACTTTCTGCGCTCCCTCCTGGAACCAGACCCTGTGAAGAGGCCG 

1 1 1 1 1 1 III III::: 1 II : : : 1 1 1 1 1 1 1 1 1 
SerThrAspCysGluAsnLeuLeuLysLysPheLeuIleLeuAsnProSerLysArgGly 


998 


Db 


241 


260 


Qy 


999 


AAT AT C C AGCAAGC GCT GGC GAAT C G CT GGT T GAAT GAGAATT ACACT GGAAAGGT GC CC 

:::::: 1 1 1 : : : : : : I I I I I I : : : 1 1 1 : : : 
ThrLeuGluGlnlleMetLysAspArgTrpMetAsnValGlyHisGluAsp 


1058 


Db 


261 


277 


Qy 


1059 


TGCAATGTCACCTATCCCAACAGGATTTCTTTGGAAGACCTGAGTCCCAGCGTGGTG 

:::::: 1 1 1 III III 
AspGluLeuLys ProTyrValGluPro 


1115 


Db 


278 


286 


Qy 


1116 


CTGCACATGACTGAAAAGCTGGGCTATAAGAAC 

: : : : : : 1 1 1 1 1 1 
LeuProAspTyrLysAspProArgArgThrGluLeuMetValSerMetGlyTyrThrArg 


1148 


Db 


287 


306 


Qy 


1149 


AGTGACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTC 

:::::: :::::::::::: | | | :::::: | | | | | | 
GluGluIleGlnAspSerLeuValGlyGlnArgTyrAsnGluValMetAlaThrTyrLeu 


1208 


Db 


307 


326 


Qy . 


1209 


CT GT T GAACAAGAAACTT GAGC GCT ATTTGT CAG GGAAAT CAGATAT C CAA GATAGC 

1 1 1 II 1 III |||::::::::: III::: 
LeuLeuGlyTyrLys SerSerGluLeuGluGlyAspThr 


1265 


Db 


327 


339 


Qy 


1266 


ATCT GCTACAAGACCCAGCT CTACCAGATAGAGAAGTGCAGAGCCACCAAGGAGCCCTAT 


1325 


Db 


340 


III 1 II : : : 

IleThrLeuLysProArg Pro 


346 


Qy 


1326 


GAGGCCTCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAG 

III III : : : II 1 III MINI 
SerAlaAspLeuThrAsnSerSerAlaGlnPheProSerHisLysValGlnArgSerVal 


1376 


Db 


347 


366 


Qy 


1377 


GATAAAAAGCCCAAAGAACAA GAAAAAAGAGGTGATTTTCTC CAC 


1421 


Db 


367 


1 I I I I I : : : : : : :::::: III : : : 
SerAlaAsnProLysGlnArgArgPheSerAspGlnAlaGlyProAlalleProThrSer 


386 


Ov 


1422 


CGTCCGTTTTCCAAGAAGTTGGACAAGAACCTGCCTTCTCACAAACAGCCATCGCCCTCG 

:::|||IHIII III :::|||:::||| 
AsnSerTyrSerLysLysThrGlnSerAsnAsnAlaGluAsnLysArgProGlu 


1481 


Db 


387 


404 


Qy 


1482 


CTGATCACACAGCTCCAGAGTACCAAAGCCCTGCTCAAAGACAGGAAGGCCTCCAAGTCA 

: : : | | | I I I : : : : : : : : : 


1541 


Db 


405 


412 



Qy 1542 GGCTTCCCCGACAAAGATTCCTTCGTCTGCCGCAATCTTTTCCGAAAAACCTCTGATTCC 1601 

Db 412 412 

Qy 1602 AATTGTGTGGCTTCTTCTTCCATGGAATTCATCCCTGTCCCACCTCCCAGGACACCAAGG 1661 

MINIMI::: ::: I I I I I I I I I 

Db 413 AlaSerSerThrAlaLys ValProAlaSerProLeuProGly 426 

Qy 1662 ATTGTAAAGAAACTAGAGCCACACCAACCAGGGCCGGGAAGTGCCAGCATCCTC 1715 

: : : : : : I I I Ml Ml : : : M I : : : I M 

Db 427 LeuGluArgLys LysThrThrProThrProSerThrAsnSerValLeuSerThr 444 

Qy 1716 CCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGCTCCTTTGAGTCT 1763 

::: M M M M I Ml 

Db 445 SerThrAsnArgSerArgAsnSerProLeuLeuGlxiArgAlaSerLeuGlyGlnAlaSer 464 

Qy 1764 GTGGATCGA GAGGAC C ACATAGAACT GCT GT C C C CTT CT CAC CATT AT AGGAT C CT G 1820 

• •• •••iii ••• • • • 

••■ •••iii •■• ••• 

Db 465 IleGlnAsnGlyLysAspSerLeuThrMetProGlySerArgAlaSerThrAlaSerAla 484 

Qy 1821 AGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCCCAGGGGCTG 1880 

Ml::: I I I I II I II I II ::::::::: 
Db 485 SerAlaGlyValSerAlaAlaArgProArgGlnHisGlnLysSer 499 

Qy 1881 CTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTCTCTTTTGCC 1940 

: : : I I II I I I M III 
Db 500 MetSerGlySerValHisPro 506 

Qy 1941 CACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCTCCCGTTCCC 2000 

:::::: II I I II III III . Ill 

Db 507 AsnLysAlaSerGlyLeuProProThrGluSerAsn CysGluValProArgProSer 525 

Qy 2001 AGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGT GT GAAGAGCAGGGGA 2054 

: : : III::: MINI::: : : : III III 

Db 526 ThrAlaProGlnArgValProValAlaSerProSerAlaHisAsnlleSerSerSerGly 545 

Qy 2055 C GGT T C CC CATGAT GG GCAT CGGACAGAT G CT GAGGAAGC GGCAC CAGAGCCT G CAGC CT 2114 

M I : : : I II I II I M M I : : : : : : I 

Db 546 -GlyAlaPro-Asp ArgThrAsnPheProArgGlyValSerSerArgSerThrP 563 

Qy 2115 TC CT CAGAGAGGT C CCT GGACGCCAGCAT GT C CCCT CT GCAGC C CAT A 2162 

II I I M M : : : M I ::: | | | :::::: M I : 
Db 563 HeHisAlaGlyGlnLeuArgGlnValArgAspGlnGlnAsnLeuProTyrGlyVal T 582 

Qy 2163 GCCCCCTCCAGCCT 217 6 

: : II I III 
Db 582 hrProAlaSerPro 586 
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714 
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HUNK_MOUSE 

Q68UT7 

HUNK_HUMAN 

Q6VZ18 

Q6VZ17 



088866 mus musculu 
Q68ut7 pan troglod 
P57058 homo sapien 
Q6vzl8 xenopus lae 
Q6vzl7 xenopus lae 
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6 


1284 


14.2 


279 


2 


Q68UT6 


Q68ut6 


pan troglod 


7 


691 


7.7 


792 


2 


Q6INT7 


Q6i nt7 


xenopus lae 


8 


648 


7.2 


771 


2 


Q69ZI7 


Q69zi7 


mus musculu 


9 


646.5 


7.2 


795 


2 


Q9P0L2 


Q9p012 


homo sapien 


10 


646 


7.2 


752 


2 


Q8CIP4 


Q8ci p4 


mus musculu 


11 


644.5 


7.1 


793 


2 


008678 


008678 


rattus norv 


12 


640 


7.1 


752 


1 


MRK4_HUMAN 


Q96134 


homo sapien 
homo sapien 


13 


638 


7.1 


745 


2 


Q15524 


Q15524 


14 


. 636 


7.0 


797 


2 


Q8VHF0 


Q8vhf0 


rattus norv 


15 


634 


7.0 


. 729 


2 


Q9JKE4 


Q9ike4 


mus musculu 


16 


633 


7.0 


744 


2 


Q9JKE5 


Q9]ke5 


mus musculu 


17 


632 


7.0 


752 


2 


Q8NG37 


Q8ng37 


homo sapien 


18 


632 


7.0 


795 


2 


Q8VHJ5 


Q8vh j 5 


mus musculu 


19 


631 


7.0 


888 


2 


Q8BR95 


Q8br95 


mus musculu 


20 


629 


7.0 


778 


2 


Q96HB3 


Q96hb3 


homo sapien 


21 


628.5 


7.0 


755 


2 


Q7KZI7 


Q7kzi7 


homo sapien 


22 


628 


7.0 


722 


2 


Q6PDR4 


Q6pdr4 


mus musculu 


23 


628 


7.0 


780 


2 


Q804T1 


Q804tl 


xenopus lae 


24 


627 


6.9 


776 


1 


MRK3_HUMAN 


P27448 


homo sapien 


25 


625.5 


6.9 


942 


2 


Q8MVW9 


Q8mvw9 


haemonchus 


26 


625.5 


6.9 


1066 


2 


Q8MVX0 


Q8mvx0 


haemonchus 


27 


623 


6.9 


722 


2 


008679 


008679 


rattus norv 


28 


621.5 


6.9 


691 


2 


Q96RG0 


Q96rg0 


homo sapien 


29 


620 


6.9 


776 


2 


Q7ZYL7 


Q7zyl7 


xenopus lae 


30 


620 


6.9 


785 


2 


Q8QGV3 


Q8qgv3 


xenopus lae 


31 


619.5 


6.9 


1371 


2 


Q9Y2K2 


Q9y2k2 


homo sapien 


32 


618.5 


6.9 


719 


2 


Q68A18 


Q68al8 


homo sapien 


33 


618.5 


6.9 


993 


2 


Q7KRK4 


Q7krk4 


drosophila 


34 


617.5 


6.8 


1098 


2 


Q7KRK7 


Q7krk7 


drosophila 


35 


614.5 


6.8 


722 


2 


Q802W0 


Q802w0 


brachydanio 


36 


608 


6.7 


966 


2 


Q8MVX2 


Q8mvx2 


haemonchus 


37 


607 


6.7 


725 


2 


Q804T2 


Q804t2 


xenopus lae 


38 


606.5 


6.7 


834 


2 


Q8MVX1 


Q8mvxl 


haemonchus 


39 


606 


6.7 


1096 


2 


Q17368 


Q17368 


caenorhabdi 


40 


602 


6.7 


1189 


2 


Q6PHV1 


Q6phvl 


brachydanio 


41 


601.5 


6.7 


1192 


2 


Q17346 


Q17346 


caenorhabdi 


42 


601.5 


6.7 


1192 


2 


Q9TW45 


Q9tw45 


caenorhabdi 


43 


600 


6.6 


774 


1 


MRK2_MOUSE 


Q05512 


mus musculu 


44 


598.5 


6.6 


462 


2 


075271 


075271 


homo sapien 


45 


598 


6.6 


1187 


2 


Q6NSM8 


Q6nsm8 


brachydanio 



ALIGNMENTS 

RESULT 1 
HUNK_MOUSE 

ID HUNKJVIOUSE STANDARD; PRT; 714 AA. 

AC 088866; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 05-JUL-2004 (Rel. 44, Last annotation update) 

DE Hormonally up-regulated neu tumor-associated kinase (EC 2.7.1.37) 
DE (serine/tnreonine-protein kinase MAK-V) . 
GN Name=Hunk; Synonyms=Makv; 
OS Mus musculus (Mouse). 

oc Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
oc Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
OX NCBl_TaxlD=10090 ; 
RN [1] 

RP SEQUENCE FROM N.A. 

RC strain=A/SN; TlSSUE=Mammary gland; 

RX MEDLINE=97396592; PubMed=9273061; 

RA Korobko I. v., Kabishev A. A. , Kiselev S.L.; 
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RT "Identification of the new protein kinase specifically transcribed in 
RT mouse tumors with high metastatic potential."; 
RL Dokl. Akad. Nauk SSSR 354:554-556(1997). 
RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=FVB; TISSUE=Mammary gland; 

RX MEDLINE=20130113; PubMed=10662544; DOI=10. 1006/geno . 1999.6078; 

RA Gardner H.P., Wertheim G.B.W., Ha S.I., copeland N.G. , Gilbert 0.3. , 

RA Denkins N.A., Marquis S.T., chodosh L.A.; 

RT "Cloning and characterization of Hunk, a novel mammalian SNFl-related 

RT protein kinase."; 

RL Genomics 63:46-59(2000). 

CC -!- CATALYTIC ACTIVITY: ATP + a protein = ADP + a phosphoprotein. 

CC -!- SIMILARITY: Belongs to the Ser/Thr protein kinase family. SNFl 
CC subfamily. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL ; AF055919; AAC61489.1; 

DR EMBL; AF167987; AAF35282.1; -. 

DR HSSP; Q63450; 1A06. 

DR MGD; MGI:1347352; Hunk. 

DR interPro; IPR011009; Kinase_like. 

DR InterPro; IPR000719; Proteinase. 

DR InterPro; IPR008271; Ser_thr_pkin_AS. 

DR InterPro; IPR002290; Ser_thr_pkinase. 

DR Pfam; PF00069; Pkinase; 1. 

DR ProDom; PD000001; Proteinase; 1. 

DR SMART; SM00220; S_TKc; 1. 

DR PROSITE; PS00107; PROTEIN_KINASE^ATP; 1. 

DR PROSITE; PS50011; PROTEIN_KINASE_DOM; 1. 

DR PROSITE; PS00108; protein_kinase_st; 1. 

KW ATP-binding; serine/threonine-protein kinase; Transferase. 

FT DOMAIN 62 320 Protein kinase. 

FT NP_BIND 68 76 ATP (By similarity). 

FT BINDING 91 91 ATP (By similarity). 

FT ACT_SITE 186 186 Proton acceptor (By similarity). 

FT CONFLICT 697 697 T -> I (in Ref. 2). 

SQ SEQUENCE 714 AA; 79602 MW; D35A53E7A8D9BD1F CRC64; 

Alignment scores: 
Pred. No.: 
Score: 

Percent similarity: 
Best Local Similarity: 
Query Match: 
DB: 

US-10-032-256A-1 (1-5024) x HUNK_MOUSE (1-714) 

QY 72 ATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGGGGGGCGATGGA 131 

I I I I I I I I I I I II I I I I IN I I I I I I I MM Ml M I M I I i I I I I I I I M I I I I I 1 I 
Db 1 MetProAlaAlaAlaGlyAspGlyLeuLeuGlyGluProAlaAlaProGlyGlyAspGly 20 



2.42e-206 


Length: 


714 


3720.00 


Matches: 


713 


99.86% 


Conservative: 


0 


99.86% 


Mismatches: 


1 


41.21% 


indels: 


0 


1 


Gaps: 


0 



Qy 132 GGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCCTGCCCGCCTGG 191 

MIIIIIIIIIIIIIIMIIIIIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIM 
Db 21 GlyAlaGluAspThrThrArgProAlaAlaAlaCysGluGlySerPheLeuProAlaTrp 40 
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Qy 


192 


Db 


41 


Qy 


252 


Db 


61 


Qy 


312 


Db 


81 


Qy 


372 


Db 


101 


Qy 


432 


Db 


121 


Qy 


492 


Db 


141 


Qy 


552 


Db 


161 


Qy 


612 


Db 


181 


Qy 


672 


Db 


201 


Qy 


732 


Db 


221 


Qy 


792 


Db 


241 


Qy 


852 


Db 


261 


Qy 


912 


Db 


281 


Qy 


972 


Db 


301 


Qy 


1032 


Db 


321 


Qy 


1092 
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GTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGCGCGTGGGCAAC 251 

I 1 1 I II I I M I I 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 I I! I I II I 1 1 1 1 1 1 1 I II 1 1 Ml I I I 
Val Se rGl yVal SerArgGl uArgLeuArgAspPheGl nm sm sLysArgVal Gl yAsn 60 

TACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGCGCGAGGGGCTG 3 11 

I I I I I M I I I I I IN I I Mill II I III II I II I II II I I I I I I I Mill II II I II I II 
TyrLeulleGlyserArgLysLeuGlyGluGlySerPheAlaLysValArgGluGlyLeu 80 

CACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAAGAGCCAAGAAA 371 

I 1 I I I I I I I I I [ I I I 1 i I I I I I I I 1 I I k I I 1 I I 1 I I I I I 1 1 I I I i I I I I I I I I I I I I I I I 

Hi sval LeuThrGlyGl uLysVal Al all eLysVal II eAspLysLysArgAl aLysLys 100 

GACACCTACGTCACCAAAAACCTGCGTCGAGAGGGGCAGATCGAGCAGATGATCCGACAC 431 

MIIIIIMIIIIIIIIMIMIIIIMMMIIIMIIIIIIIIIIMIIIIIIIIIM 
AspThrTyrValThrLysAsnLeuArgArgGl uGlyGl nil eGl nGl nMetll eArgm s 120 

CCCAACATCACACAGCTCCTGGACATCTTGGAGACAGAGAACAGCTACTACCTGGTCATG 491 

I I I I I I I 1 1 I I I I I I I I I 1 I 1 I I I I I I 1 t 1 I I I 1 I I I 1 I I I I I I I I I 1 1 I I 1 I I I I I I I 1 
ProAsnlleThrGlnLeuLeuAsplleLeuGluThrGluAsnSerTyrTyrLeuValMet 140 

GAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAACGGTTGGATGAA 551 

I I 1 1 1 I I i M I M M I I I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 ! 
GluLeucysProGlyGlyAsnLeuMetmsLyslleTyrGluLysLysArgLeuAspGlu 160 

GCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACCTGCACCGTGCG 6 11 

I I I II II I M I I 1 1 I M II I 1 1 I M II II I M I II M I I M I II M I II I 1 1 I M I II M 

Al aGl uAl aArgArgTy rll eArgGl nLeull eSerAl aval Gl um sLeum sArgAl a 180 

GGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 671 

I I M I I I I I I I I II III I II I I II II II I I I I I II I I I I I I I I I I Ml M I I M I I II I I 

Gl yVal Val Hi sArgAspLeuLysll eGl uAsnLeuLeuLeuAspGl uAspAsnAsnll e 200 

AAGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACTCGGATCCATTC 731 

I II M II 1 1 1 II II I M II I I II II II II II II I II II II II I II II I I II II II I 1 1 I I 
LysLeulleAspPheGlyLeuSerAsnCysAlaGlylleLeuGlyTyrSerAspProPhe 220 

AGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCAGGAAGAAATAT 791 

MMMMMMMMMMIMMMMMMMMMMMMMMMMMMMI 
SerThrGl nCysGl yserProAl aTyrAl aAl aProGl uLeuLeuAl aArgLysLysTyr 240 

GGCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGCTGACGGGGACC 851 

I I I I I I I II I II II 1 1 I I M 1 1 I I I I I I M I I M 1 1 1 1 I M I 1 1 II 1 1 1 1 1 1 II I I II 1 1 
GlyProLyslleAspValTrpSerlleGlyValAsnMetTyrAlaMetLeuThrGlyThr 260 

CTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGATGGTGGACAAA 9 11 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M i 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 _ 
LeuProPheThrValGluProPheSerLeuArgAlaLeuTyrGlnLysMetValAspLys 280 

GCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTCTGCGCTCCCTC 971 

I M I II II I I I I 1 1 1 1 I II I II I II I II II II M I I I I II 1 1 II I II I 1 1 1 1 I II I I I II 
AlaMetAsnProLeuProThrGlnLeuSerThrGlyAlaValAsnPheLeuArgserLeu 300 

CTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGCTGGCGAATCGCTGGTTG 10 3 1 

M I II I 1 1 I M M I I I I II M I M I I II I I I I I II I I I 1 1 1 1 II I II I II 1 1 I M I II 1 1 
LeuGl uProAspProVal LysArgProAsnll eGl nGl nAl aLeuAl aAsnArgTrpLeu 320 

AATG AG AATTACACTGG AAAGGTGCCCTGCAATGTC ACCTATCCCAACAGGATTTCTTTG 10 9 1 
I I I II I I I I I II I II I I I I I II II I I I II I II I I II I I II I I II M II II I I I II I II I I 
AsnGluAsnTyrThrGlyLysValProCysAsnValThrTyrProAsnArglleSerLeu 340 

GAAGACCTGAGTCCCAGCGTGGTGCTGCACATGACTGAAAAGCTGGGCTATAAGAACAGT 115 1 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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441 
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1452 


Db 


461 
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481 


Qy 


1572 
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501 


Qy 


1632 
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521 


Qy 


1692 
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541 


Qy 


1752 


Db 


561 


Qy 


1812 


Db 


581 


Qy 


1872 


Db 


601 


Qy 


1932 


Db 


621 


Qy 


1992 


Db 


641 


Qy 


2052 
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360 



GACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCATCTACTTCCTG 12 11 

I I 1 1 I I M I I I 1 1 I 1 1 II I 1 1 1 1 1 I I I 1 1 I 1 1 1 1 1 1 1 1 I 1 1 I 1 1 I I I I 1 1 I I 1 1 I I I I I I 

AspVal II eAsnThrVal LeuSerAsnArgAl aCysHi sll eLeuAl all eTyrPheLeu 380 

TTGAACAAGAAACTTGAGCGCTATTTGTCAGGGAAATCAGATATCCAAGATAGCATCTGC 12 71 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 

LeuAsnLysLysLeuGluArgTyrLeuSerGlyLysSerAsplleGlnAspSerlleCys 400 

TACAAGACCCAGCTCTACCAGATAGAGAAGTGCAGAGCCACCAAGGAGCCCTATGAGGCC 1331 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TyrLysThrGI nLeuTy rGl nil eGl uLysCysArgAl aThrLysGl uProTyrGl uAl a 420 

TCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATAAAAAGCCCAAA 1391 

I I I I I I II I I I I I I II I I I I I I II I I I II MINI I I I I I I I I I II I II I I I I I I II Ml 

421 SerLeuAspThrTrpThrArgAspPheGl uPheHi sAl aval Gl nAspLysLysProLys 440 

GAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGTTGGACAAGAAC 145 1 

I I I I I I II I I I II I II I I I I I I II I I I II I I III I I I I I I I II I II I II I II Ml Mill 
GluGlnGluLysArgGlyAspPheLeuHisArgProPheSerLysLysLeuAspLysAsn 460 

CTGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGAGTACCAAAGCC 1511 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M I M 1 1 

LeuProSerHi sLysGl nProSerProSerLeull eThrGl nLeuGl nSerThrLysAl a 480 

CTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATTCCTTCGTCTGC 1571 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M I M I M I M 1 1 1 1 M I M 1 1 

LeuLeuLysAspArgLysAlaSerLysSerGlyPheProAspLysAspSerPheValCys 500 

CGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTTCCATGGAATTC 1631 

II II II II II I II I II I I II II II II II II II II I I I I II I II II I II I I I I II I II II I 
ArgAsnLeuPheArgLysThrSerAspSerAsnCysVal Al aSerSerSerMetGl uPhe 520 

ATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGCCACACCAACCA 1691 

1 1 II M II III I II II I III II II MM Mill I Ml I MM 1 1 II 1 1 1 II I II 1 1 1 II I 

II eProVal ProProProArgThrProArgll eval LysLysLeuGl uProHi sGl nPro 540 

GGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGGATATGGTACGC 1751 

1 1 II 1 1 1 1 III I M 1 1 1 1 1 1 II I III 1 1 Mill I 1 1 1 1 1 1 1 1 I I 1 1 II I I II 1 1 II 1 1 1 1 

GlyProGl ySerAl aSerll eLeuProLysGl uGl uProLeuLeuLeuAspMetVal Arg 560 

TCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTTCTCACCATTAT 1811 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SerPheGl uSerVal AspArgGl uAspHi sll eGl uLeuLeuSerProSerHi sHi sTyr 580 

AGG ATCCTG AGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGAGGACACTCTCC 1871 

I I M I II I 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 MM 1 1 1 1 1 1 I II I II I Ml 1 1 1 1 I Mill 
Argil eLeuSerSerProValserLeuAlaArgArgAsnSerSerGluArgThrLeuSer 600 

CAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATTCCACGCTGGTC 1931 
I I II I II II II II II I I I II II II II I III MM II II II I I I III I III I Ml I II III 

GlnGlyLeuLeuSerGlySerThrSerProLeuGlnThrProLeuHisSerThrLeuVal 620 

TCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGTGTTCACCGCCT 1991 

I Ml I II II MM I II I I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 II II I II I II I M I Mill 

SerPheAlaHisGluGl uLysAsnSerProProLysGluGluGlyValCysSerProPro 640 

CCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTGTGAAGAGCAGG 2051 

I I I I I I Ml I I I I I II I I M I I I I I I I I I I II II I I I I I I I I I Ml I I I I I I I I I II Ml 

ProVal ProSerAsnGlyLeuLeuGlnProLeuGlySerProAsnCysValLysSerArg 660 
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III MINN MINIMI 
661 GlyArgPheProMetMetG 



2112 



0-032-256a-l. n2p. rup 

I I I I 11 Ill i I I I ! I I I I I 1 I I 1 I I I I 

yll eGl yd nMetLeuArgLysArgHi sGl nSerLeuGl n 680 



2171 



CCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCATAGCCCCCTCC 
I I I I I I I I 1 I I 1 I I I I I I I I I I I I I i I I I II Ml I II 

681 ProSerSerGluArgSerLeuAspAlaSerMetSerProLeuGlnProThrAlaProSer 700 



2172 



2213 



AGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGT 
I I I I I I I I I I I I I I I I I I I I 1 I I 1 I I I I I I I I } I I I 1 I I I I 1 

701 SerLeuSerPheAspMetAlaAspGlyValLysGlyGlnCys 714 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: April 19, 2005, 23:19:57 ; Search time 745 Seconds 

(without alignments) 
11034.435 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-032-256A-1 
5024 

1 gcaggaggagccagggcagc aaaaaaaaaaaaaaaaaaaa 5024 



Scoring table: IDENTITY^ NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 1202784 seqs, 818138359 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



2405568 



Post-processing: 



Database 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

lssued_Patents_NA: * 



/cgn2_6/ptodata/l/i na/5A_C0MB . seq : * 
/cgn2_6/ptodata/l/i na/5B_COMB . seq : * 
/cgn2_6/ptodata/l/i na/6A_COMB . seq : * 
/cgn2_6/ptodata/l/i na/6B_COMB . seq : * 
/cgn2_6/ptodata/l/i na/PCTUS_COMB . seq : * 
/cgn2_6/ptodata/l/i na/backf i 1 esl. seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



% 



SUMMARIES 



It 

0. 


Score 


Query 
Match 


Length 


DB 


ID 






1 


157.8 


3.1 


279 


4 


US 


-09- 


-621-976-14904 


2 


136.6 


2.7 


2175 


4 


US 


-09- 


-984-890-1 


3 


136.6 


2.7 


2175 


4 


US 


-10- 


-274-194-1 


4 


136.6 


2.7 


2224 


4 


US 


-09- 


-949-016-2384 


5 


136.6 


2.7 


2224 


4 


US- 


-09- 


-949-016-2385 


6 


136.6 


. 2.7 


2950 


4 


US 


-09- 


-949-016-1546 


7 


136.6 


2.7 


2950 


4 


US 


-09- 


-949-016-1547 


8 


122.8 


2.4 


1647 


3 


US 


-09- 


-101-146-44 


9 


121.4 


2.4 


2499 


4 


US- 


-09- 


-949-016-5704 


10 


119.8 


2.4 


2104 


3 


US 


-09- 


-313-930-1 


11 


119.8 


2.4 


2104 


4 


US- 


-09- 


-023-655-1191 


12 


114.2 


2.3 


3609 


4 


US 


-09- 


-799-875-6 


13 


114.2 


2.3 


5983 


4 


US 


-09- 


-799-875-4 


14 


110.6 


2.2 


2274 


4 


US 


-09- 


•772-647-3 


15 


108.8 


2.2 


2899 


4 


us 


-09- 


-774-528-112 


16 


105.4 


2.1 


1539 


4 


us 


-09- 


-633-328B-1 


17 


101.8 


2.0 


1442 


4 


us 


-09- 


-949-016-1590 



Descri ption 



Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 
Sequence 



14904, A 
1, Appli 
1, Appli 

2384, Ap 

2385, Ap 

1546, Ap 

1547, Ap 
44, Appl 
5704, Ap 
1, Appli 
1191, Ap 
6, Appli 
4, Appli 
3, Appli 
112, App 
1, Appli 
1590, Ap 
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18 


101.8 


2.0 


1442 


4 


US-09-949-016-1591 


Sequence 


19 


101.6 


2.0 


1863 


4 


US-09-949-016-4303 


Sequence 


20 


100.4 


2.0 


1599 


3 


US-09-256-465-1 


Sequence 


21 


100.4 


2.0 


1599 


3 


US-09-167-322-3 


Sequence 


22 


100.4 


2.0 


1599 


4 


US-09-023-655-1004 


Sequence 


23 


100.4 


2.0 


2908 


3 


US-09-930-181-1 


Sequence 


24 


99.4 


2.0 


2980 


4 


US-10-003-690-1 


Sequence 


25 


98.2 


2.0 


1480 


4 


US-09-016-434-1454 


Sequence 


26 


98.2 


2.0 


2181 


4 


US-09-417-197-70 


Sequence 


27 


98.2 


2.0 


2184 


4 


US-09-417-197-138 


Sequence 


28 


98.2 


2.0 


2610 


2 


US-09-212-771-1 


Sequence 


29 


98.2 


2.0 


2610 


3 


US-09-091-058-1 


Sequence 


30 


98.2 


2.0 


2610 


4 


US-09-023-655-1206 


Sequence 


31 


98.2 


2.0 


2610 


4 


US-09-590-740-1 


Sequence 


32 


97.8 


1.9 


2169 


4 


US-09-016-434-1147 


Sequence 


33 


96.6 


1.9 


1104 


4 


US-09-691-861A-1 


Sequence 


34 


96.6 


1.9 


4162 


2 


US-08-459-448A-26 


Sequence 


35 


96.6 


1.9 


4162 


3 


US-08-459-595A-26 


Sequence 


36 


96.6 


1.9 


4162 


3 


US-08-459-504B-26 


Sequence 


37 


96.6 


1.9 


4162 


3 


US-08-459-444-26 


Sequence 


38 


96.6 


1.9 


4162 


3 


US-09-547-422-26 


Sequence 


39 


96.6 


1.9 


4162 


4 


US-09-988-462-26 


Sequence 


40 


96.6 


1.9 


4165 


1 


US-07-951-715A-26 


Sequence 


41 


96.6 


1.9 


9862 


4 


US-09-691-861A-3 


Sequence 


42 


96.4 


1.9 


727 


4 


US-09-270-767-28248 


Sequence 


43 


96.4 


1.9 


1371 


4 


US-09-270-767-12475 


Sequence 


44 


96.4 


1.9 


2902 


4 


US-09-579-664B-4 


Sequence 


45 


96.4 


1.9 


2902 


4 


US-10-355-975A-4 


Sequence 



1591, Ap 
4303, Ap 
1, Appli 
3, Appli 
1004, Ap 
1, Appli 
1, Appli 
1454, Ap 
70, Appl 
138, App 
1, Appli 
1, Appli 
1206, Ap 
1, Appli 
1147, Ap 
1, Appli 
26, Appl 
26, Appl 
26, Appl 
26, Appl 
26, Appl 
26, Appl 
26, Appl 

3, Appli 
28248, A 
12475, A 

4, Appli 
4, Appli 



ALIGNMENTS 



RESULT 1 

US-09-621-976-14904 

; sequence 14904, Application US/09621976 

; Patent No. 6639063 

; GENERAL INFORMATION: 

; APPLICANT: Dumas Milne Edwards, D.B. 

; APPLICANT: Dobert, S. 

; APPLICANT: Giordano, J.Y. 

; TITLE OF INVENTION: ESTs and Encoded Human Proteins. 

; FILE REFERENCE: GENSET . 054PR2 

; CURRENT APPLICATION NUMBER: US/09/621 , 976 

; CURRENT FILING DATE: 2000-07-21 

; NUMBER OF SEQ ID NOS: 19335 

; SOFTWARE: Patent. pm 

; SEQ ID NO 14904 

; LENGTH: 279 

; TYPE: DNA 

; ORGANISM: Homo sapiens 
US-09-621-976-14904 

Query Match 3.1%] Score 157.8; DB 4; Length 279; 

Best Local Similarity 79.3%; Pred. No. 5.5e-32; 

Matches 226; conservative 0; Mismatches 52; Indels 7; Gaps 3; 
Qy 4731 TTATATGGAAAATGTGGCTATCCTTTTGTTAAGTGCAGAGTGTATTGTCTGTTTGACCCA 4790 

IIIIIIIIIM I II III 1 1 1 1 Mill I II I MM MM II II 

Db 1 TTATATGGAAAGCTCGACTCTCCCTTTGGTAAGTCCGAAGCATGTTGTTTGTTCGA-CCG 59 

Qy 4791 TGACTGTCCTTCATGAATGAGTCTTTGCCTGTGATTCTAGTCAGCCTGTGGCTACTGATG 4850 

llllllll I I I lllll MIMMIMM lllll lllll I llllll 
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60 TGACTGTCTTCC TCAGTCTGTGCCTGTGATTCCAGTCACCCTGTAGTTACTGACA 114 

4851 GGAA-CGGCCGATCTGTCATCATGTGAAGTCCAGGAGGAAGAATCTATTTTAGTCATACG 4909 

III III I II MM IIIIIMII llllllll II 1 1 1 1 1 1 III I 

115 GAAATTGACTGGACTGTCATTGTGTGAAGTCTAGGAGGAAATGTCCATTTTAATTGTATG 174 
4910 ATTTGGTCATGAGTAAGGACTATATTTATGTCACCACTATTGAATATATGTACTTTTATA 4969 

II III II II I III II II III 1 1 MM Mill II II MM II I Mill llllll II 

175 ATTTGGTCATAAGTAAGGACTATATTTATGTCACCATTATTAGATATATGTACTTTTGTA 234 

4970 ATGGCTGTGAAATACACTTTTTCCTCACAAAAAAAAAAAAAAAAA 5014 

III I I I M I I I II I I llllll MMIIMIMIMM 

235 ATGACTGTGAAATACACTTTTCCCTCACTAAAAAAAAAAAAAAAA 279 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 

Run on: April 19, 2005, 23:22:33 ; Search time 17388 Seconds 

(without alignments) 
11795.202 Million cell updates/sec 

Title: US-10-032-256A-1 
Perfect score: 5024 

Sequence: 1 gcaggaggagccagggcagc aaaaaaaaaaaaaaaaaaaa 5024 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 45554873 seqs, 20411521753 residues 

Total number of hits satisfying chosen parameters: 91109746 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Pending_Patents_NA^Main:* 

1: /cgn2_6/ptodata/l/pna/PCTUSl_COMB . seq : * 

2 : /cgn2_6/ptodata/l/pna/PCTUS2_COMB . seq : * 

3 : /cgn2_6/ptodata/l/pna/PCTUS_COMB . seq : * . 

4 : /cgn2__6/ptodata/l/pna/US06_COMB . seq : * 

5 : /cgn2_6/ptodata/l/pna/US07_COMB . seq : * 

6 : /cgn2_6/ptodata/l/pna/US080_COMB . seq : * 

7 : /cgn2_6/ptodata/l/pna/US081_COMB . seq : * 

8 : /cgn2_6/ptodata/l/pna/US082_COMB . seq : * 

9 : /cgn2__6/ptodata/l/pna/US083_COMB . seq : * 

10 : /cgn2_6/ptodata/l/pna/US084_COMB . seq : * 

11 : /cgn2_6/ptodata/l/pna/US085_COMB . seq : * 

12 : /cgn2_6/ptodata/l/pna/US086_COMB . seq : * 

13 : /cgn2_6/ptodata/l/pna/US087_COMB . seq : * 

14 : /cgn2_6/ptodata/l/pna/US088«COMB . seq : * 

15 : /cgn2_6/ptodata/l/pna/US089_COMB . seq : * 

16 : /cgn2_6/ptodata/l/pna/US090_COMB . seq : * 

17 : /cgn2_6/ptodata/l/pna/US091_COMB . seq : * 

18 : /cgn2_6/ptodata/l/pna/US092/UCOMB . seq : * 

19 : /cgn2_6/ptodata/l/pna/US092B_COMB . seq : * 

20 : /cgn2_6/ptodata/l/pna/US093A^COMB . seq : * 

21 : /cgn2_6/ptodata/l/pna/US093B_C0MB . seq : * 

22 : /cgn2_6/ptodata/l/pna/US094_COMB . seq : * 

2 3 : /cgn2_6/ptodata/l/pna/us095A^COMB . seq : * 

24 : /cgn2_6/ptodata/l/pna/US095B_COMB . seq : * 

2 5 : /cgn2_6/ptodata/l/pna/US095C_COMB . seq : * 

26 : /cgn2_6/ptodata/l/pna/US095D_COMB . seq : * 

27 : /cgn2_6/ptodata/l/pna/us096A_COMB . seq : * 

28 : /cgn2_6/ptodata/l/pna/US096B_COMB . seq : * 

29 : /cgn2_6/ptodata/l/pna/US096C_COMB . seq : * 

30 : /cgn2_6/ptodata/l/pna/US096D_COMB . seq : * 

31 : /cgn2_6/ptodata/l/pna/US096E_COMB . seq : * 

32 : /cgn2_6/ptodata/l/pna/US097A^C0MB . seq : * 

33 : /cgn2_6/ptodata/l/pna/US097B_COMB . seq : * 
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34: . /cgn2_6/ptodata/l/pna/US097C_COMB.seq: 

35 : /cgnZ^G/ptodata/l/pna/USOgSA^COMB . seq : 

36 : /cgn2_6/ptodata/l/pna/US098B_COMB . seq : 

37 : /cgn2_6/ptodata/l/pna/US098C_COMB . seq : 

38 : /cgn2_6/ptodata/l/pna/US098D_COMB . seq : 

39 : /cgn2_6/ptodata/l/pna/US099A_COMB . seq : 

40 : /cgn2_6/ptodata/l/pna/US099B_COMB . seq : 

41 : /cgn2_6/ptodata/l/pna/US099C_COMB . seq : 

42 : /cgn2_6/ptodata/l/pna/US099D_COMB . seq : 

43 : /cgn2_6/ptodata/l/pna/US099E_COMB . seq : 

44 : /cgn2_6/ptodata/l/pna/US099F_COMB . seq : 

45 : /cgn2_6/ptodata/l/pna/US099G_COMB . seq : 

46 : /cgn2_6/ptodata/l/pna/USl00A^COMB . seq : 

47 : /cgn2_6/ptodata/l/pna/USlOOB_COMB . seq : 

48 : /cgn2_6/ptodata/l/pna/USl01/^COMB . seq : 

49 : /cgn2_6/ptodata/l/pna/USl0lB_COMB . seq : 

50 : /cgn2_6/ptodata/l/pna/USl02A_COMB . seq : 

51: /cgn2_6/ptodata/l/pna/USl02B_COMB . seq : 

52 : /cgn2_6/ptodata/l/pna/USl03/^COMB .seq : 

53 : /cgn2_6/ptodata/l/pna/USl03B_COMB .seq : 

54 : /cgn2_6/ptodata/l/pna/USl04/^COMB . seq : 

55: /cgn2_6/ptodata/l/pna/USl04B_COMB.seq: 

56 : /cgn2_6/ptodata/l/pna/usl05^COMB .seq : 

57 : /cgn2_6/ptodata/l/pna/usl05 B_COMB .seq : 

58 : /cgn2_6/ptodata/l/pna/USl06A_COMB .seq : 

59 : /cgn2_6/ptodata/l/pna/USl07A^COMB .seq : 

60 : /cgn2_6/ptodata/l/pna/USl07B_COMB . seq : 

61 : /cgn2_6/ptodata/l/pna/USl07C_COMB . seq : 

62 : /cgn2_6/ptodata/l/pna/USl07D_COMB . seq : 

63 : /cgn2_6/ptodata/l/pna/USl08A_COMB . seq : 

64 : /cgn2_6/ptodata/l/pna/USl08B_COMB . seq : 

65 : /cgn2_6/ptodata/l/pna/USl09A^COMB . seq : 

66 : /cgn2_6/ptodata/l/pna/USl09B_COMB . seq : 

67 : /cgn2_6/ptodata/l/pna/USl09C_COMB . seq : 

68 : /cgn2_6/ptodata/l/pna/USH0_COMB . seq : * 

69 : /cgn2_6/ptodata/l/pna/US6000_COMB . seq : * 

70 : /cgn2_6/ptodata/l/pna/us6001_COMB . seq : * 

71: /cgn2_6/ptodata/l/pna/us6002_COMB . seq : * 

72 : /cgn2_6/ptodata/l/pna/US6003_COMB . seq : * 

73 : /cgn2_6/ptodata/l/pna/US6004_COMB . seq : * 

74 : /cgn2_6/ptodata/l/pna/US6005_COMB . seq : * 

75 : /cgn2_6/ptodata/l/pna/US6006_COMB . seq : * 

76 : /cgn2_6/ptodata/l/pna/US6007_COMB . seq : * 

77 : /cgn2_6/ptodata/l/pna/US6008_COMB .seq : * 

78 : /cgn2_6/ptodata/l/pna/US6009_COMB .seq : * 

79 : /cgn2_6/ptodata/l/pna/US6010_COMB .seq : * 

80 : /cgn2_6/ptodata/l/pna/US6011_COMB . seq : * 

81: /cgn2_6/ptodata/l/pna/US6012_COMB . seq : * 

82 : /cgn2_6/ptodata/l/pna/US6013_COMB . seq : * 

83 : /cgn2_6/ptodata/l/pna/us6014_COMB . seq : * 

84 : /cgn2_6/ptodata/l/pna/us6015_COMB . seq : * 

85 : /cgn2_6/ptodata/l/pna/US6016_COMB . seq : * 

86 : /cgn2_6/ptodata/l/pna/U56017_COMB . seq : * 

87 : /cgn2_6/ptodata/l/pna/US6018_COMB . seq : * 

88 : /cgn2_6/ptodata/l/pna/US6019_COMB .seq : * 

89 : /cgn2_6/ptodata/l/pna/US6020_COMB . seq : * 

90 : /cgn2_6/ptodata/l/pna/US6021_COMB . seq : * 

91 : /cgn2_6/ptodata/l/pna/US6022_COMB .seq : * 

92 : /cgn2_6/ptodata/l/pna/US602 3A^COMB . seq : * 

93 : /cgn2_6/ptodata/l/pna/US602 3B_COMB . seq : * 

94 : /cgn2_6/ptodata/l/pna/US6024_COMB . seq : * 

95 : /cgn2_6/ptodata/l/pna/US602 5_COMB . seq : * 

96 : /cgn2_6/ptodata/l/pna/US6026_COMB . seq : * 
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97 : /cgn2_6/ptodata/l/pna/US6027_COMB . seq : * 

98 : /cgn2_6/ptodata/l/pna/US6028_COMB . seq : * 

99 : /cgn2_6/ptodata/l/pna/US6029_COMB .seq:* 
100 : /cgn2_6/ptodata/l/pna/US6030_COMB . seq : * 
101: /cgn2„6/ptodata/l/pna/US603LcOMB . seq : * 
102 : /cgn2_6/ptodata/l/pna/US6032_COMB . seq : * 
103 : /cgn2_6/ptodata/l/pna/US6033_COMB . seq : * 
104 : /cgn2_6/ptodata/l/pna/US6034_COMB . seq : * 
105 : /cgn2_6/ptodata/l/pna/US6035_COMB . seq : * 
106 : /cgn2_6/ptodata/l/pna/US6036_COMB . seq : * 
107 : /cgn2_6/ptodata/l/pna/US6037_COMB . seq : * 
108 : /cgn2_6/ptodata/l/pna/US6038_COMB . seq : * 
109 : /cgn2_6/ptodata/l/pna/US6039_COMB . seq : * 
110 : /cgn2_6/ptodata/l/pna/US6040_COMB . seq : * 
111: /cgn2_6/ptodata/l/pna/US604LCOMB . seq : * 
112 : /cgn2_6/ptodata/l/pna/US6042_COMB . seq : * 
113 : /cgn2_6/ptodata/l/pna/US6043_COMB . seq : * 
114 : /cgn2_6/ptodata/l/pna/US6044_COMB . seq : * 
115 : /cgn2_6/ptodata/l/pna/US6045_COMB . seq : * 
116 : /cgn2_6/ptodata/l/pna/U56046_COMB . seq : * 
• 117: /cgn2_6/ptodata/l/pna/US6047_COMB.seq:* 
118 : /cgn2_6/ptodata/l/pna/US6048_COMB . seq : * 
119 : /cgn2_6/ptodata/l/pna/US6049_COMB . seq : * 
120 : /cgn2_6/ptodata/l/pna/US6050_COMB . seq : * 
121: /cgn2_6/ptodata/l/pna/US605LCOMB . seq : * 
122 : /cgn2_6/ptodata/l/pna/US6052_COMB . seq : * 
12 3 : /cgn2_6/ptodata/l/pna/us605 3_COMB . seq : * 
124 : /cgn2_6/ptodata/l/pna/US6054_COMB . seq : * 
125 : /cgn2_6/ptodata/l/pna/US6055_COMB. seq : * 
126 : /cgn2_6/ptodata/l/pna/US6056_COMB . seq : * 
127 : /cgn2_6/ptodata/l/pna/us6057_COMB . seq : * 
128 : /cgn2_6/ptodata/l/pna/us6058_COMB . seq : * 
129 : /cgn2_6/ptodata/l/pna/US6059_COMB . seq : * . 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 

and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 




Query 








Description 


NO. 


Score 


Match 


Length DB ID 


1 


5024 


100.0 


5024 


46 


US-10-032-256A-1 


Sequence 1, Appli 


2 


4821.6 


96.0 


5026 


62 


US-10-764-420-2502 


Sequence 2502, Ap 


3 


3575.4 


71.2 


3641 


48 


US-10-144-771-15940 


Sequence 15940, A 


4 


3575.4 


71.2 


3641 


106 


US-60-360-207-15940 


Sequence 15940, A 


5 


1754.2 


34.9 


2826 


46 


US-10-032-256A-18 


Sequence 18, Appl 


6 


1751.6 


34.9 


7385 


115 


US-60-452-680-11295 


Sequence 11295, A 


7 


1751 


34.9 


7385 


65 


US-10-932-349-1369 


sequence 1369, Ap 


8 


1751 


34.9 


7385 


120 


US-60-500-337-1369 


Sequence 1369, Ap 


9 


1750 


34.8 


7385 


33 


US-09-732-630-4120 


Sequence 4120, Ap 


10 


1750 


34.8 


7385 


49 


US-10-170-235-33817 


Sequence 33817, A 


11 


1750 


34.8 


7385 


62 


US-10-756-149-1579 


Sequence 1579, Ap 
Sequence 40, Appl 


12 


1714.6 


34.1 


2145 


44 


US-09-979-167-40 


13 


1508 


30.0 


7257 


115 


US-60-452-680-11294 


sequence 11294, A 


14 


1506.4 


30.0 


7257 


49 


US-lO-170-235-32765 


Sequence 32765, A 


15 


1040.2 


20.7 


1343 


48 


US-10-144-771-20325 


Sequence 20325, A 


16 


1040.2 


20.7 


1343 


106 


US-60-360-207-20325 


Sequence 20325, A 


c 17 


766.8 


15.3 


996 


87 


US-60-184-774-251 


Sequence 251, App 


18 


733 


14.6 


1872 


23 


US-09-523-654A-31 


Sequence 31, Appl 


19 


733 


14.6 


1872 


23 


US-09-523-654B-31 


Sequence 31, Appl 


20 


733 


14.6 


1872 


39 


US-09-902-919-31 


Sequence 31, Appl 
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21 


510 


.8 


10. 


2 


142692 


65 


US-10-932-349-19459 


Sequence 


19459, A 


22 


510 


.8 


10. 


2 


142692 


120 


US-60-500-337-19459 


Sequence 19459, A 


c 23 


510 


.6 


10. 


2 


1748349 


41 


US-09-947-914-48 


Sequence 48, Appl 


24 


487 


9. 


7 


546 


44 


US-09-963-353-207 


sequence 


207, App 


25 


487 


9. 


7 


546 


53 


US-10-375-682-207 


Sequence 


207, App 


26 


484 


. 2 


9. 


6 


675 


46 


US-10-029-386-20693 


Sequence 


20693, A 


27 


482 


' 2 


9. 


6 


656 


1 


PCT-USO1-O0663-13551 


Sequence 13551, A 


28 


482 


\ 2 


9. 


6 


656 


1 


PCT-US01-00663-14025 


Sequence 14025, A 


29 


482 


\ 2 


9. 


6 


656 


1 


PCT-US01-00663-14066 


Sequence 14066, A 


30 


482 


\ 2 


9. 


6 


656 


37 


US-09-864-761-17140 


Sequence 


17140, A 


31 


482 


\ 2 


9. 


6 


656 


37 


US-09-864-761-17590 


Sequence 


17590, A 


32 


482 


.2 


9. 


6 


656 


37 


US-09-864-761-17630 


Sequence 


17630, A 


33 


482 


.2 


9. 


6 


656 


49 


US-10-182-993-13122 


Sequence 


13122, A 


34 


482 


.2 


9. 


6 


656 


49 


US-10-182-993-13577 


sequence 


13577, A 


35 


482 


.2 


9. 


6 


656 


49 


US-10-182-993-13617 


Sequence 


13617, A 


36 


482 


.2 


9. 


6 


656 


49 


US-10-182-995-10286 


Sequence 


10286, A 


37 


482 


.2 


9. 


6 


656 


49 


US-10-182-995-10736 


Sequence 


10736, A 


38 


482 


.2 


9. 


6 


656 


49 


US-10-182-995-10776 


Sequence 


10776, A 


39 


482 


.2 


9. 


6 


656 


49 


US-10-182-997-9603 


Sequence 


9603, Ap 


40 


482 


.2 


9. 


6 


656 


49 


US-10-182-997-10072 


Sequence 


10072, A 


41 


482 


.2 


9. 


6 


656 


49 


US-10-182-997-10113 


Sequence 


10113, A 


42 


482 


.2 


9. 


6 


656 


49 


US-10-182-998-5380 


Sequence 


5380, Ap 


43 


482 


.2 


9. 


6 


656 


49 


US-10-182-998-5835 


Sequence 


5835, Ap 


44 


482 


.2 


9. 


6 


656 


49 


US-10-182-998-5875 


Sequence 


5875, Ap 


45 


482 


.2 


9. 


6 


656 


50 


US-10-203-134-13419 


Sequence 


13419, A 



ALIGNMENTS 



RESULT 1 

US-10-032-256A-1 

; Sequence 1, Application US/10032256A 

; GENERAL INFORMATION: 

; APPLICANT: CHODOSH, Lewis A 

; APPLICANT: GARDNER, Heather P 

; TITLE OF INVENTION: HORMONALLY UP-REGULATED, NEU-TUMOR-ASSOCIATED KINASE 

; FILE REFERENCE: 22253-70421 

; CURRENT APPLICATION NUMBER: US/10/032 , 256A 

; CURRENT FILING DATE: 2002-05-21 

; PRIOR APPLICATION NUMBER: 60/257,073 

; PRIOR FILING DATE: 2000-12-21 

; NUMBER OF SEQ ID NOS: 18 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 1 

; LENGTH: 5024 

; TYPE: DNA 

; ORGANISM: Murinae gen. sp. 
US-10-032-256A-1 

Query Match 100.0%; Score 5024; DB 46; Length 5024; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 5024; conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 GCAGGAGGAGCCAGGGCAGCCCCGGGAGCCGGAGGAGGAGCGGCTGCGAGCGCGGGAGCC 60 

I I II II I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I M I I II I I I I I I I I I I 

Db 1 GCAGGAGGAGCCAGGGCAGCCCCGGGAGCCGGAGGAGGAGCGGCTGCGAGCGCGGGAGCC 60 

Qy 61 GAGCGAGCGCGATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGG 120 

II I II II Ml I I I I I Ml I I II Ml I II I I I I II II III I I I II I I I II I II M I I I I M 

Db 61 GAGCGAGCGCGATGCCGGCAGCGGCGGGGGACGGGCTCTTGGGCGAGCCGGCGGCACCGG 120 

Qy 121 GGGGCGATGGAGGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCC 180 
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I I ! I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I M II I I I I I I! M I I I I I II 

GGGGCGATGGAGGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCC 



Db 


121 


0* 0*0* 0* 0*0* A **T0* 0* A r* 0* r* r~ r~ 0* 0* A 0*0* A 0* A 0*0* A 0*0* A 0* 0* 0* 0* 0* 0* 0* 0* 0* 0*0* 0* 0*0" 1 0*0*0* A 0*0* 0* A A / "l ■■■ 1 0-0* 

GGGGCGATGGAGGCGCGGAGGACACGACCAGGCCGGCGGCGGCCTGCGAGGGAAGTTTCC 


180 


Qy 


181 


TGCCCGCCTGGGTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGC 


240 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


181 


d^ d^ d** d^ d~* d^ d* 1 1 1 ^ * M l" A d^ d^ d^ d^* d~* d* * 1 '1 ' ^ * M f ^» dk d* m 'V d*** d** d** d*** d*** A d* 1 *l' "1 ' A A A A A * 

TGCCCGCCTGGGTGAGCGGCGTGTCCCGCGAGCGGCTCCGGGACTTCCAGCACCACAAGC 


240 


Qy 


241 


GCGTGGGCAACTACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGC 


300 




1 II 1 II 1 Ml 1 1 II 1 1 1 1 Mill II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 II II II 1 1 1 1 




Db 


"> A 1 

241 


GCGTGGGCAACTACCTCATCGGCAGCAGGAAGCTGGGAGAGGGCTCCTTCGCCAAGGTGC 


300 


Qy 


301 


GCGAGGGGCTGCACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAA 


360 




1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 




Db 


301 


GCGAGGGGCTGCACGTGCTGACGGGAGAAAAGGTAGCTATCAAGGTCATCGATAAGAAAA 


360 


Qy 


361 


GAGCCAAGAAAGACACCTACGTCACCAAAAACCTGCGTCGAGAGGGGCAGATCCAGCAGA 


420 




M II 1 1 1 M 1 1 M M 1 M II II MM 1 1 M M M MM 1 1 1 1 M 1 1 1 M M II II 1 M 1 1 




Db 


361 


GAGCCAAGAAAGACACCTACGTCACCAAAAACCTGCGTCGAGAGGGGCAGATCCAGCAGA 


420 


Qy 


421 


TGATCCGACACCCCAACATCACACAGCTCCTGGACATCTTGGAGACAGAGAACAGCTACT 


480 




1 1 1 1 M 1 1 MM M 1 1 1 1 M M Ml 1 1 1 1 M 1 1 1 1 1 MM M 1 1 1 1 1 1 II 1 M 1 M 1 1 M 




Db 


421 


TGATCCGACACCCCAACATCACACAGCTCCTGGACATCTTGGAGACAGAGAACAGCTACT 


480 


Qy 


481 


ACCTGGTCATGGAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAAC 


540 




II M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II II 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 M M 1 1 II 1 1 1 1 1 


540 


Db 


481 


ACCTGGTCATGGAGCTGTGTCCTGGTGGCAACCTCATGCACAAGATCTACGAAAAGAAAC 


Qy 


541 


GGTTGGATGAAGCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACC 


600 




II MM 1 1 II 1 1 1 1 1 1 1 Mill 1 1 II 1 1 1 1 1 1 1 II 1 M 1 1 1 1 II 1 1 1 1 1 1 Mill 1 1 M 




Db 


541 


GGTTGGATGAAGCCGAGGCCCGCAGATACATCCGGCAACTCATCTCTGCGGTGGAACACC 


600 


Qy 


601 


TGCACCGTGCGGGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAG 


660 




M 1 1 II II 1 1 II II 1 1 1 1 II II M 1 1 M M II M M 1 II 1 1 M II 1 1 1 II 1 II II II II 1 




Db 


601 


TGCACCGTGCGGGGGTGGTTCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAG 


660 


Qy 


661 


ACAATAATATCAAGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACT 


720 




1 1 1 1 1 II II M 1 M 1 1 1 1 M 1 1 II M 1 1 M II 1 1 II M 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 M 


720 


Db 


661 


ACAATAATATCAAGCTGATTGACTTTGGCTTGAGCAACTGTGCAGGGATCCTAGGTTACT 


Qy 


721 


CGGATCCATTCAGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCA 


780 




1 M II 1 1 M 1 1 1 1 1 1 II 1 1 III II II III 1 1 III II 1 1 1 1 II 1 II II 1 II II 1 M 1 1 1 


-7 a r\ 

780 


Db 


721 


CGGATCCATTCAGCACACAGTGTGGCAGCCCTGCCTATGCTGCGCCAGAACTGCTTGCCA 


Qy 


781 


GGAAGAAATATGGCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGC 


840 




II 1 1 MM M 1 M 1 1 1 1 MM 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Ml II 1 M II II 1 1 1 MM 1 MM 




Db 


781 


GGAAGAAATATGGCCCCAAAATTGATGTCTGGTCAATAGGCGTGAACATGTATGCCATGC 


o a r\ 

840 


Qy 


841 


TGACGGGGACCCTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGA 


900 




1 II IM 1 1 Ml M 1 M M II IMIMM 1 1 1 1 II Ml 1 1 II 1 M II M MM 1 MM 1 M 




Db 


841 


TGACGGGGACCCTACCTTTCACTGTGGAGCCTTTCAGCCTGAGGGCTCTGTATCAGAAGA 


900 


Qy 


901 


TGGTGGACAAAGCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAACTTTC 


960 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


901 


TGGTGGACAAAGCAATGAATCCCCTGCCGACCCAGCTCTCCACAGGGGCCGTCAAC MIC 


960 


Qy 


961 


TGCGCTCCCTCCTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGGTGGCGA 


1020 




1 MM MM II II 1 M IMIIIMM 1 1 II II II III MM M Ml III 1 1 III II MM 




Db 


961 


TGCGCTCCCTCCTGGAACCAGACCCTGTGAAGAGGCCGAATATCCAGCAAGCGCTGGCGA 


1020 


Qy 


1021 


ATCGCTGGTTGAATGAGAATTACACTGGAAAGGTGCCCTGCAATGTCACCTATCCCAACA 


1080 




II 1 1 II MM II 1 1 1 IM 1 MIIMI III MIMM MIMI 1 II II III MM Mill 




Db 


1021 


ATCGCTGGTTGAATGAGAATTACACTGGAAAGGTGCCCTGCAATGTCACCTATCCCAACA 


1080 
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Qy 


1081 


GGATTTCTTTGGAAGACCTGAGTCCCAGCGTGGTGCTGCACATGACTGAAAAGCTGGGCT 


1140 




1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1081 


GGAI MCI T 1 GGAAGACCTGAGTCCCAGCGTGGTGCTGCACATGACTGAAAAGCTGGGCT 


1140 


Qy 


1141 


ATAAGAACAGTGACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCA 


1200 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1141 


ATAAGAACAGTGACGTCATCAACACGGTGCTCTCCAACCGCGCCTGCCACATCCTGGCCA 


1200 


Qy 


1201 


TCTACTTCCTGTTGAACAAGAAACTTGAGCGCTATTTGTCAGGGAAATCAGATATCCAAG 


1260 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1201 


TCTACTTCCTGTTGAACAAGAAACTTGAGCGCTATTTGTCAGGGAAATCAGATATCCAAG 


1260 


Qy 


1261 


ATAGC ATCTG CTAC AAG AC C C AGCTCTAC C AG AT AG AG AAGTG C AG AG CC AC C AAGG AGC 


1320 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 fl 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 




Db 


1261 


ATAGCATCTGCTACAAGACCCAGCTCTACCAGATAGAGAAGTGCAGAGCCACCAAGGAGC 


1320 


Qy 


1321 


CCTATGAGGCCTCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATA 


1380 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1321 


CCTATGAGGCCTCCCTGGATACCTGGACGAGGGACTTTGAATTCCATGCTGTGCAGGATA 


1380 


Qy 


1381 


AAAAGCCCAAAGAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGT 


1440 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1381 


AAAAGCCCAAAGAACAAGAAAAAAGAGGTGATTTTCTCCACCGTCCGTTTTCCAAGAAGT 


1440 


Qy 


1441 


TGGACAAGAACCTGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGA 


1500 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1441 


TGGACAAGAACCTGCCTTCTCACAAACAGCCATCGCCCTCGCTGATCACACAGCTCCAGA 


1500 


Qy 


1501 


GTACCAAAGCCCTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATT 


1560 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

lllltllllllllllllllllllllllliriltllllllllrllliliiiiilllriiii 


1560 


Db 


1501 


GTACCAAAGCCCTGCTCAAAGACAGGAAGGCCTCCAAGTCAGGCTTCCCCGACAAAGATT 


Qy 


1561 


CCTTCGTCTGCCGCAATC II 1 1 CCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTT 


1620 




II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1620 


Db 


1561 


CCTTCGTCTGCCGCAATCTTTTCCGAAAAACCTCTGATTCCAATTGTGTGGCTTCTTCTT 


Qy 


1621 


CCATGGAATTCATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGC 


1680 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 


1680 


Db 


1621 


CCATGGAATTCATCCCTGTCCCACCTCCCAGGACACCAAGGATTGTAAAGAAACTAGAGC 


Qy 


1681 


CACACCAACCAGGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGG 


1740 




1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


1681 


CACACCAACCAGGGCCGGGAAGTGCCAGCATCCTCCCCAAGGAAGAGCCCCTGCTGCTGG 


1740 


Qy 


1741 


ATATGGTACGCTCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTT 


1800 




1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1741 


ATATGGTACGCTCCTTTGAGTCTGTGGATCGAGAGGACCACATAGAACTGCTGTCCCCTT 


1800 


Qy 


1801 


CTCACCATTATAGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGA 


1860 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ■ i i i i i i i i i i i i i i i i i i i 1 i 




Db 


1801 


CTCACCATTATAGGATCCTGAGCTCGCCTGTGAGCCTGGCTCGTAGGAATTCTAGTGAGA 


1860 


Qy 


1861 


GGACACTCTCCCAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATT 


1920 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1861 


GGACACTCTCCCAGGGGCTGCTGTCCGGAAGTACCTCACCTCTCCAAACTCCACTGCATT 


1920 


Qy 


1921 


CCACGCTGGTCTC 1 1 1 rGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGT 


1980 




IIIIMIIIMIM 1 IIIIIIIIIIIIMMMIIIIIMIII IMIIIMMIIII III 




Db 


1921 


CCACGCTGGTCTCTTTTGCCCACGAAGAAAAGAACAGCCCCCCGAAAGAGGAGGGTGTGT 


1980 


Qy 


1981 


GTTCACCGCCTCCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTG 


2040 




III MIIIIIMMIIIIIIIIIIMIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


1981 


GTTCACCGCCTCCCGTTCCCAGTAATGGCCTCCTGCAGCCTCTGGGGAGCCCCAACTGTG 


2040 
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Qy 


2041 


TGAAGAGCAGGGGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCACC 


2100 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 




TGAAGAGCAGGGGACGGTTCCCCATGATGGGCATCGGACAGATGCTGAGGAAGCGGCACC 


2100 


Qy 


2101 


AGAGCCTGCAGCCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCA 


2160 




IIIIIIIMIIIII 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2101 


AGAGCCTGCAGCCTTCCTCAGAGAGGTCCCTGGACGCCAGCATGTCCCCTCTGCAGCCCA 


2160 


Qy 


2161 


TAGCCCCCTCCAGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGTTAACCTG 


2220 




1 II II MM II 1 1 1 1 1 1 ! 1! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 




Db 


2161 


A A^ m A^ X^ X"* X** X * | A X^ v"*- — T~ X " 1 X"" y"" fit X*» A X"* A a*+ a** a** X* A^ m A a^a^a*' 1 f X' 1 I X"* A A X"* X™ X** X** A X fc ■ y i^T A A x*» ** * W a** 

TAGCCCCCTCCAGCCTCTCCTTTGACATGGCCGACGGTGTCAAGGGCCAGTGTTAACCTG 


2220 


Qy 


2221 


GGATGGCAAGATTCTGGGTCTCTGTGAGGACAGCCACGGAACAGAGCTCCACACAGGCAG 


2280 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


2221 


GGATGGCAAGATTCTGGGTCTCTGTGAGGACAGCCACGGAACAGAGCTCCACACAGGCAG 


2280 


Qy 


2281 


GCACCAGGGCATGGGTGAACAACCTCACGGGAGCATCCTTTATTCTTTTATACCTGCCAC 


2340 




1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II 1 1 1 1 I 1 II 1 1 1 1 II 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 




Db 


2281 


S^- a** A j^— A X^ X"* * a** A** A* * W A** A A A A X*- — t— * ^ i" 1 X** 1 X* A X~" X* A 1" X* X " ■ ■ ■ A ■ ■ /►■T't ' t T" A A X* ■ y— X*» a 

GCACCAGGGCATGGGTGAACAACCTCACGGGAGCATCCTTTATTC 1 1 1 lATACCTGCCAC 


2340 


Qy 


2341 


ACAAAGTCCCACGCTTGTATCAGCTGAAGTCCACACTCAAAGTCCACGCACTTACTTAGG 


2400 




II 1 III 1 1 1 II II 1 1 Ml 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 M 1 1 1 1 1 1 II III 1 MM II 1 1 




Db 


2341 


ACAAAGTCCCACGCTTGTATCAGCTGAAGTCCACACTCAAAGTCCACGCACTTACTTAGG 


2400 


Qy 


2401 


GACCCTCTGAGACGCTGCCACTAGGGGGAGGGGGAGGGGGCAGACTGTGGGAATCACACC 


2460 




MINI MM MM II M MM MM 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 




Db 


2401 


a x* a** a* 1 HI" y A ^ A X* a+- a 0 ** 9 A** A X 1 ** g A A*"* x** X*» A y*> X* x* g+m a ^ X^ X* a+* A** X» A X* A X~ m 1 X » ■ X*» /a. a A a+*> A X** A X* 

GACCCTCTGAGACGCTGCCACTAGGGGGAGGGGGAGGGGGCAGACTGTGGGAATCACACC 


2460 


Qy 


2461 


TTCCAGCCTGAGATTTTCTTTGCTATCACCAATCACTGAGCCCTCTCCAGGATCCCCTCA 


2520 




1 II 1 1 1 1 M 1 II 1 1 1 1 1 II II M 1 M II 1 1 1 M 1 1 1 1 II II 1 1 1 1 1 1 1 1 Ml 1 II M M 




Db 


2461 


■ ■ i a x* 1 y^ At » ■ 1 y*" A A X* ■ ■ ■ a** X ** 1 A A X* X* A A a** A X » ■ A X 1 * A** X * ■ V * ■ X 1 * A X* A** A A+* a** X* a* * W a** A 

TTCCAGCCTGAGA 1 1 1 1 CTTTGCTATCACCAATCACTGAGCCCTCTCCAGGATCCCCTCA 


2520 


Qy 


2521 


GTGGGCTCAGAGCTAAAAACCACACCTCCATCTGCTGGGCCAATCAGATTTCCAGACTGG 


2580 




1 1 1 1 1 1 M 1 1 II 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 M M 1 II 1 II 1 II 1 1 M 1 1 M 1 II M II 1 




Db 


2521 


At i ■ X^ a** a** a* * ■ A** A X* A y™* X » ■ A A A A A X"^ A X 1 * A a* ' '■ a ^ > 1 y i"F y— • A >m a A T" y» A A 1 ■« 1 a+* a** A A > 1 ■ T X** X*** 

GTGGGCTCAGAGCTAAAAACCACACCTCCATCTGCTGGGCCAATCAGATTTCCAGACTGG 


2580 


Qy 


2581 


TACCAGGTTGTCCCTCCCCTCCTCTCTGTGTGTCTCTCACAGTTCTGTAACTGACCGTCA 


2640 




Mill II M 1 1 II M 1 II II II II II II 1 II II M II M M 1 1 1 1 M II 1 1 1 1 1 1 1 1 M 




Db 


2581 


TACCAGGTTGTCCCTCCCCTCCTCTCTGTGTGTCTCTCACAGTTCTGTAACTGACCGTCA 


2640 


Qy 


2641 


GTGGTCAGTTACAGTCTCACGCGGACGTGCCACTCGCTGGTAAGGACGTTCACCCAACCT 


2700 




Ml 1 II 1 II M 1 1 1 Ml 1 1 1 II 1 II II 1 M 1 1 II M 1 1 II M M 1 M 1 II 1 III II II II 




Db 


2641 


GTGGTCAGTTACAGTCTCACGCGGACGTGCCACTCGCTGGTAAGGACGTTCACCCAACCT 


2700 


Qy 


2701 


AGGGATCCCTCTACAGAGGGAAGCAACCCTCCTTTCCCTAACAGTGAGTCCCCACAGAGT 


2760 




M 1 1 III 1 II 1 1 1 1 1 1 II II II Ml 1 II 1 1 M II II 1 II 1 1 1 1 1 M 1 Ml II II M II 1 




Db 


2701 


A j— /* A — V— s~ J > ■■ A A ^ A A~* A** x** A A X* X* A A X* X* Ji»<r X^ X > 1 ■ 11 ""T* X 1 ** X X " ■ A A X 1 A X * ■ X^ A X * 1 X 1 * X^ /* A X^ A X^ A X * ■ 

AGGGATCCCTCTACAGAGGGAAGCAACCCTCCTTTCCCTAACAGTGAGTCCCCACAGAGT 


2760 


Qy 


2761 


GCTGAGTCACAGTGCTGGACCGGGAGGAAGATGGGATGGCGCCTCAGACAGAGATGGAAC 


2820 




II II II MM llllll M IIIMI III Mill MM MM MM Mill II MINI 




Db 


2761 


GCTGAGTCACAGTGCTGGACCGGGAGGAAGATGGGATGGCGCCTCAGACAGAGATGGAAC 


2820 


Qy 


2821 


CCAGCAGCGAGAACCCAGGAGGAAGACGAAGACTCAAACGCTCATTCCTGTGCAACGTTT 


2880 




Ml Ml II 1 II Ml 1 1 Ml IMIIMI MM III Ml Ml II Ml Ml MINIM MM 




Db 


2821 


CCAGCAGCGAGAACCCAGGAGGAAGACGAAGACTCAAACGCTCATTCCTGTGCAACGTTT 


2880 


Qy 


2881 


TGACAGA \ 1 1 1 ICI 1 ICCICICI MCI 1 1 1 ICCCCIGACCM MCMCM 1 1 1 GGGTTGA 


2940 




II MM 1 1 1 II 1 Ml II III 1 Ml 1 Ml Mill 1 1 M 1 1 1 M 1 1 1 II MM MM II II 




Db 


2881 


TGACAGA M 1 M C M 1 CC 1 C 1 C 1 M C 1 M M CCCC TGACC7 T T TCTTCI 1 1 ! 1 GGGTTGA 


2940 


Qy 


2941 


AACTTGCTGAGGATTGAACGAACTTGTCCAAAGAGATCTTTCTTTATATGAAGTCATTAA 


3000 




II II II MM llllll IMIIMI IMIIMI II MM MM IIIMI II MM llllll 
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Db 


2941 


Qy 


3001 


Db 


3001 


Qy 


3061 


Db 


3061 


Qy 


3121 


Db 


3121 


Qy 


3181 


Db 


3181 


Qy 


3241 


Db 


3241 


Qy 


3301 


Db 


3301 


Qy 


3361 


Db 


3361 


Qy 


3421 


Db 


3421 


Qy 


3481 


Db 


3481 


Qy 


3541 


Db 


3541 


Qy 


3601 


Db 


3601 


Qy 


3661 


Db 


3661 


Qy 


3721 


Db 


3721 


Qy 


3781 


Db 


3781 


Qy 


3841 


Db 


3841 


Qy 


3901 
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TTAAAI I I I I I I I I I AAAGACAGGGTCTCATTAAGTAGCCCAAGCTGGCTTCAAACTCAT 3060 

Mil I I 1 1 MM 1 1 1 II I Mill Mill I llllll 1 1 II 1 1 1 1 1 1 II 111 111 1 1 Ml 1 1 

TTAAAI ! I I I I ! ! I 1 AAAGACAGGGTCTCATTAAGTAGCCCAAGCTGGCTTCAAACTCAT 3060 
GATCCTCCTGCCTCAGCCTCCAAAGTGCTGAGATTACAAGTATATACCCGTGTCTGGCTC 3120 

M MINI Ml I II 1 1 II III MM MIIIMIIIII Mill II I Ml I II MINI II I 

GATCCTCCTGCCTCAGCCTCCAAAGTGCTGAGATTACAAGTATATACCCGTGTCTGGCTC 3 120 
AAAATAGCAATTCAAAAACAAAAACTAGTTGGCCAGATGAAAAGTAGTTTTACCAAATTC 3 180 

I I 1 1 1 1 I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 II I II 1 1 1 1 1 1 1 1 1 1 1 II I M 1 1 1 1 1 II 

AAAATAGCAATTCAAAAACAAAAACTAGTTGGCCAGATGAAAAGTAGTTTTACCAAATTC 3 180 
ACGTG1 I I I IGI I I I ICTGAGAGGCTGCAGCTCAGATGGCCAAAAAGCTGGCAACAGGAG 32 40 

MM I I II I I I I I II I I I III I I III II I II I I I I II II I I I 11 I MM I I II I 111 II I 

ACGTG \ I n IGI I I I ICTGAGAGGCTGCAGCTCAGATGGCCAAAAAGCTGGCAACAGGAG 3240 

GACCACAGTGGCGTGCCTGCCTAAGGGATAGTAGCCTAGCCATCCTGTGTTTATACCGTG 3 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GACCACAGTGGCCTGCCTGCCTAAGGGATAGTAGCCTAGCCATCCTGTGTTTATACCGTG 3300 

GCAGCAGCAGAAGGCATAGAACTTAGCTCCAGATGGCTCTGGAGAGAGAGAAAGGATTCT 3360 

llllll II II II 1 1 II 1 1 II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 M 1 1 III 1 1 1 

GCAGCAGCAGAAGGCATAGAACTTAGCTCCAGATGGCTCTGGAGAGAGAGAAAGGATTCT 3360 
TAAAGCAGAGTTGAGACAGCAAGAAGCAGGGAATTCGCTGTGTCATGCTGTTCTGCCGTG 3420 

III II III MM I II III II Mill III lllllll III MM II II Mill II III MM 

TAAAGCAGAGTTGAGACAGCAAGAAGCAGGGAATTCGCTGTGTCATGCTGTTCTGCCGTG 3420 
GTTAGAACTTAGCTGTTCTGCTGGGAGCTAGGAGCAGGCTTGCCGCCCCCTGGGAACACG 3480 

M MM I I I I I I I I I I I I llllll I MM I II I I I I I II I I II II I I I I I I III M MM 

GTTAGAACTTAGCTGTTCTGCTGGGAGCTAGGAGCAGGCTTGCCGCCCCCTGGGAACACG 3480 
CTCACAAGACGGTTCGTCCCCAAAGGAAACAGTGCCCCCCAAACAGGCTTTCAGTCCACT 3 540 

MMIMIMM I II I II 1 1 1 II II 1 1 1 III I Ml II MM 1 1 1 II 1 1 1 1 1 1 1 III I II I 

CTCACAAGACGGTTCGTCCCCAAAGGAAACAGTGCCCCCCAAACAGGCTTTCAGTCCACT 3 5 40 
CTGTAATCTGCACCTTCCCCTCCAGGATTGAACCAAAGATGCATTTCCGGTTTTGTGACT 3600 

II 1 1 1 II I 1 1 II II I M I 1 1 1 1 II 1 1 1 II II 1 1 1 1 II II I II I II I 1 1 1 1 II I III 1 1 1 I 

CTGTAATCTGCACCTTCCCCTCCAGGATTGAACCAAAGATGCATTTCCGGTTTTGTGACT 3600 
GTGCCACTCTGTGTGTCTCTTGTGGAACCTGGTGTTGTCTGATCCTGTCCGGCTGGCGCT 3660 

MMIMIMM MM II II llllll llllll MMMIMIMM II IMIMIII III 

GTGCCACTCTGTGTGTCTCTTGTGGAACCTGGTGTTGTCTGATCCTGTCCGGCTGGCGCT 3660 

GGATGGAGGACTGTCTCTGTGTGCATCGTGGGCCCTGGTACTTAGCAGAGGACAAAGGGT 3720. 

II II II II II II II II II II II II II II II II II II II II II II I II II II II II II II I 
GGATGGAGGACTGTCTCTGTGTGCATCGTGGGCCCTGGTACTTAGCAGAGGACAAAGGGT 3720 

ACTGTTGTCAGGAGGGGAAGACTTGGCACGGGCTGGACCACAGTTAGTTTAGAAGTTATG 3780 

II II II II II II II II II II II II II II II II I II II II II II II I II II II II II II II 
ACTGTTGTCAGGAGGGGAAGACTTGGCACGGGCTGGACCACAGTTAGTTTAGAAGTTATG 3780 

G AAC AGCTC AGAATCTTCTG GTCTTTG ACTATTTC AGATGG G GTC AGAGAC CAG AGCTGT 3840 

I I I 1 1 I I I I I 1 1 I I I I I I 1 1 1 1 I I 1 1 1 1 I 1 1 I I I I 1 1 I I I I I E I I I 1 1 I I 1 1 1 1 1 1 I I I I 

GAACAGCTCAGAATCTTCTGGTCTTTGACTATTTCAGATGGGGTCAGAGACCAGAGCTGT 3 840 
AGCCAGGAAGCCAGGTTCATCATCTTGGTCCATCGATTCTAAAGTGGGCAAATTTCTGTG 3900 

MM Ml I II III I I II MUM II II Mill II I I I I Ml M M MM MM MUM 

AGCCAGGAAGCCAGGTTCATCATCTTGGTCCATCGATTCTAAAGTGGGCAAATTTCTGTG 3900 
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1 1 1 1 1 1 1 1 1 I I I 1 1 1 I 1 1 I M I I 1 1 1 I I I I 1 1 1 1 I 1 1 I 1 1 1 1 I I 1 1 I I 1 1 1 1 1 I 1 1 1 1 I I 

ACGTCACAAAGCCGGCCTTTGCCAGTGAGGGCTGAGACACAGTACAACTGCCTCTCATTT 



Db 


3901 


ACGTCACAAAGCCGGCX 1 1 ! GCCAGTGAGGGCTGAGACACAGTACAACTGCCTCTCA 1 1 1 


3960 


Qy 


3961 


ACTGGTGGCAGGCGGCTTCC 1 1 1 GGCCTCTCAGAGCTCTGACTGAACTAGAAGAGAACAC 


4020 




1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 II 1 1 1 II II 1 II Ml II 1 1 II II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 




Db 


3961 


ACTGGTGGCAGGCGGCTTCCTTTGGCCTCTCAGAGCTCTGACTGAACTAGAAGAGAACAC 


4020 


Qy 


4021 


GGATTTGGCTGACCCTGGAAGAAAGCTGCTCTAGTCCTGGCTGAATTTGGTAAGACCTGG 


4080 




1 1 1 II 1 1 1 1 1 1 Ml 1 1 1 1 II 1 1 1 1 1 1 MM II 1 1 Mill 1 1 1 III 1 MM III 1 II MM 




Db 


4021 


GGATTTGGCTGACCCTGGAAGAAAGCTGCTCTAGTCCTGGCTGAATTTGGTAAGACCTGG 


4080 


Qy 


4081 


ACTACTTAAACCTTAGGGAGGGACTGACTCCCTCCCGAGGACCCATTACAGGAGGAGGCC 


4140 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 




Db 


4081 


ACTACTTAAACCTTAGGGAGGGACTGACTCCCTCCCGAGGACCCATTACAGGAGGAGGCC 


4140 


Qy 


4141 


AGGCTTTTCTCCCAGAGCTGATGGTGTTCTTCATTCAGCATGGCTTCCGTTCAGCTCCCA 


4200 




1 II 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 




Db 


A 1 A "1 

4141 


AGGC II 1 1 CTCCCAGAGCTGATGGTGTTCTTCATTCAGCATGGCTTCCGTTCAGCTCCCA 


4200 


Qy 


4201 


GGACTTGACACTGAAAATAGAACTCTTTAAGCAGAGAGAAGAGGAGAACCATCCACAGAC 


4260 




M i M M M M 1 M 1 1 MM 1 1 1 Ml 1 II 1 1 1 M III 1 1 III 1 1 M 1 1 M 1 II 1 II 1 1 1 1 




Db 


4201 


GGACTTGACACTGAAAATAGAACTCTTTAAGCAGAGAGAAGAGGAGAACCATCCACAGAC 


4260 


Qy 


4261 


GCTCCCCGTATTTGATGTGACGTGTTTGAGCTTTGACGGGTGAAGAGTCCTTTTAAAAGA 


4320 




II MM III M 1 MM II 1 MM III M M MM M M II M M 1 M MM M 1 M Ml 




Db 


4261 


a** a** ■ ^ » i av ^r^r^r a ^ a* ■ ■' a > * ■ ■ ■ ■ a ^ m a** m m m a*+ a a** a** a* * ■ ^» a a ^* a ■ a^ aa+ m a a a a a*+ a 

GCTCCCCGTATTTGATGTGACGTGTTTGAGCTTTGACGGGTGAAGAGTCCl I I lAAAAGA 


4320 


Qy 


4321 


TAACTGCCAGCTGCAGGCATCTGGCTCTGCAAAGCTGGTAGGATGTGTACCTGTGTACTG 


4380 




II II II MM 1 II M 1 II II II 1 III II M II M II II II 1 1 MIMMIMMM MM 




Db 


4321 


TAACTGCCAGCTGCAGGCATCTGGCTCTGCAAAGCTGGTAGGATGTGTACCTGTGTACTG 


4380 


Qy 


4381 


TGCCCGCCCCCTTTCTCCTAGCCCTTTATGTC 1 1 1 1 I CTGACTGTTTGCTTTTCTCGTAT 


4440 




II 1 II MM II 1 III 1 1 1 M 1 1 M II Ml MM Mill 1 1 II II 1 III Mill 1 1 1 II II 




Db 


4381 


TGCCCGCCCCCTTTCTCCTAGCCCTTTATGTC MM! CTGACTGTTTGC MM CTCGTAT 


4440 


Qy 


4441 


GTATGTGTGCCTGTGTTGGTGCGAGCCTGTGGAGAAAGAGTCTCCCATCCTTCAAATGCT 


4500 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


4441 


GTATGTGTGCCTGTGTTGGTGCGAGCCTGTGGAGAAAGAGTCTCCCATCCTTCAAATGCT 


4500 


Qy 


4501 


TCGAGAACAGCGTCAGATGTACAACTAGTTTGCCTGCGTTGCTACTGGTACCTTGGACTC 


4560 




1 II 1 1 1 M III II 1 1 II 1 II III 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 II 1 1 1 1 1 II 




Db 


A r n *i 

4501 


TCGAGAACAGCGTCAGATGTACAACTAGTTTGCCTGCGTTGCTACTGGTACCTTGGACTC 


4560 


Qy 


4561 


TGAACTCAGGTTACCCACCTGAGTCCTCAGTAGGCAGTGGACCCATTGAGAGGCAAATGA 


4620 




M II M II II 1 II 1 1 M 1 1 1 II 1 II 1 1 1 1 M M 1 M 1 1 1 1 1 1 M 1 1 1 11 1 1 1 II 1 INI 




Db 


4561 


TGAACTCAGGTTACCCACCTGAGTCCTCAGTAGGCAGTGGACCCATTGAGAGGCAAAJGA 


4620 


Qy 


4621 


GAACAGGAGGGAGACAAGCTGTGTTCTGGGGCGCACATAAACACCTGACAGACGAGTCTA 


4680 




II M M IM M MM 1 II 1 M MM 1 MM Ml 1 Mill Ml II 1 IIIIIMM MIMI 




Db 


4621 


GAACAGGAGGGAGACAAGCTGTGTTCTGGGGCGCACATAAACACCTGACAGACGAGTCTA 


4680 


Qy 


4681 


GGAAACCGCGTGAAAGAAGAAATGTTAAATTCTTTATTG 1 1 1 1 ATTATATTTATATGGAA 


4740 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


4681 


GGAAACCGCGTGAAAGAAGAAATGTTAAATTCTTTATTG MM ATTATATTTATATGGAA 


4740 


Qy 


4741 


AATGTGGCTATCCTTTTGTTAAGTGCAGAGTGTATTGTCTGTTTGACCCATGACTGTCCT 


4800 




M M M M 1 1 1 M 1 1 M 1 M M M 1 M M 1 M III 1 1 1 1 1 1 Ml 1 II IMIMI 1 II 1 1 




Db 


4741 


AATGTGGCTATCC 1 1 1 I GTTAAGTGCAGAGTGTATTGTCTGTTTGACCCATGACTGTCCT 


4800 


Qy 


4801 


TCATGAATGAGTCTTTGCCTGTGATTCTAGTCAGCCTGTGGCTACTGATGGGAACGGCCG 


4860 




II M MIM M MM MMI Ml M 1 II IMM M IM MM M MM II 1 1 1 MM II 1 




Db 


4801 


TCATGAATGAGTCTTTGCCTGTGATTCTAGTCAGCCTGTGGCTACTGATGGGAACGGCCG 


4860 
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Qy 


4861 


ATCTGTCATCATGTGAAGTCCAGGAGGAAGAATCTA 1 1 1 1 AGTCATACGATTTGGTCATG 


4920 




1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 

1 1 J 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 




Db 


4861 


ATCTGTCATCATGTGAAGTCCAGGAGGAAGAATCTAI 1 1 1 AGTCATACGATTTGGTCATG 


4920 


Qy 


4921 


AGTAAGGACTATATTTATGTCACCACTATTGAATATATGTACTTTTATAATGGCTGTGAA 


4980 




IIIIMIIIMIIIIIIIMIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIII 




Db 


4921 


AGTAAGGACTATATTTATGTCACCACTATTGAATATATGTAC 1 1 1 1 ATAATGGCTGTGAA 


4980 


Qy 


4981 


ATACACTTTTTCCTCACAAAAAAAAAAAAAAAAAAAAAAAAAAA 5024 






1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


4981 


ATACACTTTTTCCTCACAAAAAAAAAAAAAAAAAAAAAAAAAAA 5024 
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GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 



April 20, 2005, 00:03:12 ; Search time 1402 Seconds 

(without alignments) 
5636.860 Mil lion cell updates/sec 

US-10-032-256A-1 
5024 

1 gcaggaggagccagggcagc aaaaaaaaaaaaaaaaaaaa 5024 



18334066 



scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 9167033 seqs, 786512495 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Pending_Patents_NA_New:* 

1: /cgn2_6/ptodata/l/pna/PCT_NEW_COMB . seq : * 

2 : /cgn2_6/ptodata/l/pna/PCT_NEW_COMB . seq2 : * 

3 : /cgn2_6/ptodata/l/pna/US06_NEW_COMB . seq : * 

4 : /cgn2_6/ptodata/l/pna/US07_NEW_COMB . seq : * 

5 : /cgn2_6/ptodata/l/pna/US08_NEW_COMB . seq : * 

6 : /cgn2_6/ptodata/l/pna/US09_NEW_COMB . seq : * 

7 : /cgn2_6/ptodata/l/pna/USlO_NEW_COMB . seq : * 

8 : /cgn2_6/ptodata/l/pna/USlO_NEW_COMB . seq2 : * 

9 : /cgn2_6/ptodata/l/pna/USll_NEW_C0MB . seq : * 

10 : /cgn2_6/ptodata/l/pna/USll_NEW_COMB . seq2 : * 

11: /cgn2_6/ptodata/l/pna/usi:LNEW_COMB . seq3 : * 

12 : /cgn2_6/ptodata/l/pna/US60 JslEW_COMB . seq : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
NO. 


Score 


Query 
Match 


Length DB 


ID 






Description 


1 


171.2 


3.4 


472 


10 


US- 


-11-060-756- 


■1385 


Sequence 1385, Ap 


2 


171.2 


3.4 


472 


10 


US- 


-11-060-756- 


■5657 


Sequence 5657, Ap 


3 


155 


3.1 


1975 


2 


PCT- 


-US04-42316- 


•17 


Sequence 17, Appl 


4 


148.8 


3.0 


2110 


2 


PCT- 


-US04-42316- 


25 


Sequence 25, Appl 


5 


148.6 


3.0 


2022 


2 


PCT- 


-US04-42316- 


■15 


Sequence 15 , Appl 


6 


142.4 


2.8 


1993 


2 


PCT- 


-US04-42316- 


■9 


Sequence 9, Appli 


7 


142.4 


2.8 


2023 


2 


PCT- 


-US04-42316- 


-13 


Sequence 13, Appl 


8 


141.6 


2.8 


1885 


2 


PCT- 


-US04-42316- 


■19 


Sequence 19, Appl 


9 


140 


2.8 


1838 


2 


PCT- 


-US04-42316- 


-23 


Sequence 23, Appl 


10 


138.6 


2.8 


600 


10 


US- 


-11-060-756- 


-1783 


Sequence 1783, Ap 


11 


138.6 


2.8 


600 


10 


US- 


-11-060-756- 


-1784 


Sequence 1784, Ap 
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12 


138.6 


2.8 


13 


138.6 


2.8 


14 


131 


2.6 


15 


130.8 


2.6 


16 


130.8 


2.6 


17 


129.4 


2.6 


18 


129.2 


2.6 


19 


127.4 


2.5 


20 


124.4 


2.5 


21 


122.8 


2.4 


22 


122.8 


2.4 


23 


121.2 


2.4 


24 


120.6 


2.4 


25 


119.6 


2.4 


26 


119.2 


2.4 


27 


118.2 


2.4 


28 


113.2 


2.3 


29 


111.8 


2.2 


30 


109.8 


2.2 


31 


109.8 


2.2 


32 


109.6 


2.2 


33 


103 


2.1 


34 


102 


2.0 


35 


100.8 


2.0 


36 


100.8 


2.0 


37 


99.8 


2.0 


38 


98.8 


2.0 


c 39 


98.4 


2.0 


40 


98.2 


2.0 


41 


98.2 


2.0 


42 


98.2 


2.0 


43 


96.6 


1.9 


44 


96.6 


1.9 


45 


95.2 


1.9 
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600 10 US-11-060-756-6055 

600 10 US-11-060-756-6056 

1766 2 PCT-US04-42316-50 

3226 12 US-60-660-590-61 

4917 7 US-10-204-639-87 

1563 2 PCT-US04-42316-1 

3609 12 US-60-660-590-62 

4723 12 US-60-646-841-19 

1235 2 PCT-US04-42316-32 

2524 2 PCT-US04-42316-3 

2530 2 PCT-US04-42316-5 

2083 2 PCT-US04-42 316-40 

2959 2 PCT-US04-42360-2300 

1700 2 PCT-US04-42316-7 

1840 2 PCT-US04-42 316-36 

2986 7 US-10-524-979-1 

756 7 US-10-524-979-5 

2449 2 PCT-US04-42316-42 

2470 2 PCT-US03-27894-21 

2501 7 US-10-955-054A-138 

1124 2 PCT-US04-42316-38 

1788 2 PCT-US04-42316-44 

3010 2 PCT-US04-23309-57 

1400 10 US-11-060-756-720 

1400 10 US-11-060-756-4992 

2729 2 PCT-US04-23309-36 

4836 2 PCT-US04-23309-16 

13361 7 US-10-472-963-1665 

1400 10 US-11-060-756-3835 

1400 10 US-11-060-756-8107 

3437 7 US-10-488-292-1 

1104 7 US-10-259-740A-1 

9862 7 US-10-259-740A-3 

1402 10 US-11-059-535-2112 



Sequence 6055, Ap 
Sequence 6056, Ap 
Sequence 50, Appl 

Sequence 61, Appl 
Sequence 87, Appl 
Sequence 1, Appli 
Sequence 62, Appl 
Sequence 19, Appl 
Sequence 32, Appl 
Sequence 3, Appli 
Sequence 5, Appli 
Sequence 40, Appl 
Sequence 2300, Ap 
Sequence 7, Appli 
Sequence 36, Appl 
Sequence 1, Appli 
Sequence 5, Appli 
Sequence 42, Appl 
Sequence 21, Appl 
Sequence 138, App 
Sequence 38, Appl 
Sequence 44, Appl 
Sequence 57, Appl 
Sequence 720, App 
Sequence 4992, Ap 
Sequence 36, Appl 
Sequence 16, Appl 
Sequence 1665, Ap 
Sequence 3835, Ap 
Sequence 8107, Ap 
Sequence 1, Appli 
Sequence 1, Appli 
Sequence 3, Appli 
Sequence 2112, Ap 



ALIGNMENTS 



RESULT 1 

US-11-060-756-1385 

; Sequence 1385, Application US/11060756 
; GENERAL INFORMATION: 

; APPLICANT: Wyeth 

; APPLICANT: Mounts, William Martin 

; TITLE OF INVENTION: Nuclei c Acid Arrays for Monitoring Expression Profiles 
Drug 

; TITLE OF INVENTION: Target Genes 

; FILE REFERENCE: AM101083. (031896-042000) 

; CURRENT APPLICATION NUMBER: US/11/060 , 756 

; CURRENT FILING DATE: 2005-02-18 

; NUMBER OF SEQ ID NOS: 303284 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 1385 

; LENGTH: 472 

; TYPE: DNA 

; organism: Homo sapiens 
US-11-060-756-1385 

Query Match 3.4%; Score 171.2; DB 10; Length 472; 

Best Local Similarity 87.0%; Pred. No. 4.3e-21; 

Matches 188; Conservative 0; Mismatches 28; Indels 0; Gaps 0 
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331 AGGTAGCTATCAAGGTCATCGATAAGAAAAGAGCCAAGAAAGACACCTACGTCACCAAAA 390 

1 1 1 1 II II II Mill MINIM MINIM II 1 1 1 I I 1 1 1 MINIMI 

257 AGGTGGCCATAAAAGTCATTGATAAGAAGAGAGCCAAAAAGGACACCTATGTCACCAAAA 316 
391 ACCTGCGTCGAGAGGGGCAGATCCAGCAGATGATCCGACACCCCAACATCACACAGCTCC 450 

MINI MIIIMI I I I I II 1 1 1 1 I I I II M I 1 1 llllllll Mill 1 1 1 1 1 1 1 

3 17 ACCTGCGGCGAGAGGGTCAGATCCAGCAGATGATCCGCCACCCCAATATCACTCAGCTCC 376 
451 TGGACATCTTGGAGACAGAGAACAGCTACTACCTGGTCATGGAGCTGTGTCCTGGTGGCA 5 10 

I II II II II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 Mill MM 

377 TTGATATTTTAGAAACGGAAAACAGCTACTACCTGGTCATGGAGCTGTGCCCTGGGGGCA 436 

511 ACCTCATGCACAAGATCTACGAAAAGAAACGGTTGG 546 

MM I I I I I I I I I I I I N II Mill Ml Ml 
437 ACCTGATGCACAAGATCTATGAGAAGAAGCGGCTGG 472 
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